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Tensor Shape 
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A/B Testing 

Statistical way to compare two or more techniques 
to determine which performs better and to un- 
derstand whether the difference is statistically 
significant. 


Abduction 
See "Abductive Reasoning." 


Abductive Logic Programming (ALP) 
A framework for declarative problem solving with 
respect to higher-level knowledge representation. 


Abductive Reasoning 

The process of inferring probable conclusions 
based on information that is known. Also called 
"Abduction." 


Ablation Study 

A scientific examination to get an insight into the 
effects of a machine learning system's building 
blocks on its overall performance. 


Abstract Datatype 
A set of data values and related operations that are 
independent of any implementation. 


Accountability 
A state of being accountable for something based 
on regulation, agreement, or assignment. 


Accuracy 
A rate of correct predictions a classification 
model made. 


Action 
A mechanism in reinforcement learning in which the 
agent transitions between the environment states. 


Action Learning 
See "Action Model Learning." 


Action Model Learning 

À branch of machine learning that focuses on 
developing and changing a software agent's 
understanding of the consequences and 
preconditions of behavior in its environment. 
Also called "Action Learning." 


Action Recognition 
À computer vision task that analyzes the videos 
to identify human actions. Also called "Human 
Action Recognition." 
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Action Selection 
A process of deciding what to do next by an agent. 


Activation Function 

A mathematical formula used in Artificial Neural 
Network for processing weighted inputs of a 
node and generating output. 


Activation Map 
See "Feature Map." 


Active Learning 

A semi-supervised learning approach where the 
algorithm selects some of the data it needs for 
learning. 


Ad Targeting 

An advanced method for advertising that targets 
ads at the audiences who are most likely receptive 
to the offerings. Also called "Targeted Advertising 
(or Targeted Ads)." 


AdaGrad 

An advanced gradient descent algorithm that 
performs gradient using knowledge of the data 
geometry from previous iterations. 


Adaptive Algorithm 

An algorithm that changes its behavior at run time 
based on available information and predefined 
criteria. 


Adaptive Boosting (AdaBoost) 
À boosting technique that combines multiple 
weak classifiers to create a strong classifier. 


Adaptive Machine Learning 
À way to update and retrain machine learning 
models incrementally to adapt to new changes. 


Adaptive Neuro-Fuzzy Inference System (ANFIS) 
À technique that combines fuzzy logic and 
neural networks to enhance the applicability and 
performance of a prediction. 


Admissible Heuristic 
A function that never overestimates the cost of 
reaching the goal in a search algorithm. 


Advanced Driver Assistance Systems (ADAS) 
See "Driver Assistance." 


Adversarial Example 

The inputs that have been deliberately designed 
to trick a machine learning model into making a 
mistake. 
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Adversarial Machine Learning 

A field that focuses on designing effective 
machine learning algorithms against security 
challenges. 


Affective Computing 

A subfield of artificial intelligence concerned with 
the development of systems capable of recogniz- 
ing and processing human emotions. Also called 
"Emotion Artificial Intelligence (or Emotional Arti- 
ficial Intelligence)." 


Agent 
À physical or software entity that can perceive its 
environment and act autonomously to accomplish 
a goal. 


Agent Architecture 

A blueprint of the agent's control system that illus- 
trates the control components and the functional 
organization. 


Agglomerative Clustering 

A method of creating a hierarchical tree by as- 
signing each example to its cluster and iteratively 
merging the closest clusters. 


Algorithm 
A set of well-defined instructions to solve a prob- 
lem or perform a task. 


Algorithmic Bias 
See "Bias." 


Algorithmic Efficiency 
A metric to measure the average execution time of 
an algorithm to complete working on a set of data. 


Algorithmic Learning Theory 

An area of computational learning theory that 
follows non-statistical and non-probabilistic ap- 
proaches to learning limits. 


Algorithmic Probability 
A method of assigning a prior probability to ob- 
jects. Also called "Solomonoff Probability." 


Ambient Intelligence (Ami) 
A digital environment that is sensitive to people 
and supports their daily lives. 


Analysis of Algorithms 

The process of determining the computational 
complexity and performance of algorithms in order 
to evaluate their application suitability or compare 
them with other algorithms. 
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Analytics 

À general term that is usually used to describe a 
variety of business intelligence and other related 
initiatives, such as data analytics and web ana- 
lytics. 


Analytics Platform 
A unified technology solution that enables the 
full cycle of analytics processes, including data 
preparation, data processing, data storage, and 
data management. 


Annotator 
A worker who can assist in labeling data (e.g., im- 
ages) on-demand. Also called "Labeler" or "Rater." 


Anomaly Detection 

The process of identifying unusual or abnormal ob- 
servations in the data because of their different char- 
acteristics from the majority of the processed data. 


Answer Set Programming (ASP) 
A type of declarative programming for solving 
complex search problems. 


Anytime Algorithm 

An algorithm that gives a valid result even if it is in- 
terrupted before it ends, and the quality of results 
improves gradually as long as it runs. 


Application Programming Interface (API) 

A group of commands, functions, protocols, and 
objects that programmers use to communicate 
with external systems. 


Approximate String Matching 
A process of finding an approximate match for 
strings instead of an exact match. 


Approximation Error 
The distance between the target predictor and the 
hypothesis class. 


Area under the ROC Curve (AUC) 

An evaluation metric used to determine a model 
performance by considering all possible classifi- 
cation thresholds. 


Argumentation Framework 

À graph that has nodes representing arguments 
and edges representing the conflicts between 
these arguments. 
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Artificial General Intelligence (AG) 

À computational system that can perform any 
intellectual task a human can perform, such as 
problem-solving, creativity, and adaptability. Also 
called "General Artificial Intelligence" or “Strong 
Artificial Intelligence.” 


Artificial Immune System (AIS) 
An adaptive system for problem-solving inspired 
by the human immune system. 


Artificial Intelligence (AI) 

A computer science field that focuses on building 
systems capable of performing tasks that usually 
require human intelligence, such as learning, rea- 
soning, and self-development. Also called "Ma- 
chine Intelligence." 


Artificial Intelligence Ethics 

A set of values, principles, and techniques to guide 
moral conduct in developing and using Al tech- 
nologies. 


Artificial Intelligence Markup Language (AIML) 
An Extensible Markup Language (XML) based 
markup language for developing natural language 
applications. 


Artificial Intelligence System 
A system capable of performing tasks that usually 
require human intelligence, such as learning, rea- 
soning, and self-development. 


Artificial Narrow Intelligence (ANI) 
See "Narrow Artificial Intelligence (NAI)." 


Artificial Neural Network (ANN) 

À computational model in Al that is inspired by the 
biological neural networks of animal brains. Also 
called "Neural Network (NN)." 


Artificial Neuron 
See "Neuron." 


Artificial Superintelligence (ASI) 

See "Superintelligence." 

Association for the Advancement of Artificial In- 
telligence (AAAI) 


A non-profit scientific society dedicated to ad- 
vancing scientific research, public understanding, 
practitioner training, and responsible use of Al. 
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Associative Learning 

A rule-based technique in machine learning for 
discovering significant relationships between fea- 
tures or variables in a dataset. 


Attention Mechanism 
See "Attention Model." 


Attention Model 

A technique in a neural network that allows the 
network to focus on one aspect of a complex input 
each time until the whole dataset is categorizedl. 
Also called "Attention Mechanism." 


Attribute 

À property or characteristic of an object that hu- 
mans or automated methods can identify. Often 
used as a synonym for "Feature." 


Audio Signal Processing 


A subrfield of signal processing that focuses on 


analyzing and manipulating audio signals. 


Augmented Intelligence 

A design pattern that complements human intel- 
ligence and helps humans get smarter and faster 
in performing tasks. Also called "Intelligence Aug- 
mentation." 


Augmented Research 
The application of Al in analyzing data and testing 
hypotheses to support research in different fields. 


Autoencoder 

A type of artificial neural network used to produce 
efficient data representations for unsupervised 
learning. 


Automated Journalism 
See "Robot Journalism." 


Automated Machine Learning (AutoML) 
The process of automating machine learning tasks 
to build, train, test, and deploy models. 


Automated Planning and Scheduling 
See "Planning." 


Automated Reasoning 

A process that provides machine learning algo- 
rithms with an organized framework to define and 
solve problems. 


Automated Vehicle 
See "Autonomous Vehicle." 
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Automatic Speech Recognition (ASR) 
See "Speech Recognition." 


Automatic Summarization 
A process of shortening a part of natural language 
while keeping the important semantics. 


Automation 
A process or a system that runs without human 
intervention. 


Automation Bias 

The favoritism of a human decision-maker to the 
recommendations produced by an automated de- 
cision-making system over information produced 
without automation, even when the automated 
decision-making system makes mistakes. 


Autonomic Computing 
The ability of a system to self-manage its own 
resources without user intervention. 


Autonomous Car 
See "Autonomous Vehicle." 


Autonomous Driving 
The process of sensing the surroundings and mov- 
ing independently and self-sufficiently. 


Autonomous Robot 


A robot that performs tasks with a high degree of autonomy. 


Autonomous Vehicle 

A vehicle capable of sensing its surroundings and 
moving independently and self-sufficiently. Also 
called "Automated Vehicle." 


Autonomy (or Autonomous) 
The ability of a system to act based on its own 
rules and learning methods. 


Availability 
The property of being accessible when required 
by an authorized entity. 


Average Precision 

A metric that summarizes the performance of a 
ranked sequence of results by calculating the av- 
erage precision values for each relevant result. 
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Backpropagation 

An algorithm used in training artificial neural net- 
works to reduce errors by finding a gradient need- 
ed to calculate the network's weights. Short for 
"backward propagation of errors." 
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Backpropagation Through Time (BPTT) 
A method to apply backpropagation to recurrent 
neural networks. 


Backward Chaining 

A logical reasoning approach that goes backward 
from the goal to specify the data or proof that 
justifies it. 

Bag of Words 

A representation of words in a text that describes 
their occurrence within a document regardless of 
their order. 


Baseline Model 
Areference model to compare how well another more 
complex model is performing on a specific problem. 


Batch 
A group of examples used for model training. 


Batch inference 
See "Offline Inference." 


Batch Normalization 

A supervised learning technique for improving neu- 
ral network performance and stability by converting 
interlayer outputs into a standard format. 


Batch Processing 
The processing of a set of data or programs with 
minimal human interaction. 


Batch Size 
The number of samples passed to the machine learn- 
ing model at each iteration for training the model. 


Bayes's Law 
See "Bayes's Theorem." 


Bayes's Theorem 

A mathematical formula for calculating conditional 
probabilities that describe the probability of an 
event based on a prior event. Also called "Bayes's 
Law" or "Bayes’s Rule." 


Bayes’s Rule 
See "Bayes's Theorem." 


Bayesian Network 
A graphical model that represents variables and their 
conditional dependencies using Bayesian inference. 


Bayesian Neural Network 

A probabilistic neural network that relies on Bayes' 
Theorem to calculate uncertainties in weights and 
outputs. 
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Bayesian Programming 

A statistical method for building probability models 
and solving open-ended problems with incom- 
plete information. 


Bees Algorithm 

An algorithm that simulates the behavior of bee 
swarms in finding food by random search and can 
be used for optimization problems. 


Behavior Informatics (BI) 

A field concerned with gathering, analyzing, and 
interpreting heterogenous data to represent and 
model human behavior. 


Behavior Tree (BT) 
A method for structuring the switch between var- 
ious actions in an autonomous agent. 


Behavioral Analytics 
The process of using data about users' behaviors 
to understand and predict their intents and actions. 


Belief-Desire-Intention Software Model (BDi) 
A software model used for programming and im- 
plementing intelligent agents. 


Bellman Equation 

A recursive formula used in many reinforcement 
learning algorithms to compute the expected total 
reward of taking action. 


Bias 

A favoritism or prejudice of an Al system towards 
some groups over others. Also called "Algorithmic 
Bias." 


Bias-Variance Tradeoff 

A conflict that arises when trying to minimize bias 
and variance, keeping supervised algorithms from 
generalizing beyond their training set. 


Bidirectional 
A system in text analysis that assesses the text that 
comes before and after a specific part of a text. 


Bidirectional Encoder Representations From 


Transformers (BERT) 
A deep learning technique based on transformer 
architecture for natural language processing (NLP). 


Bidirectional Language Model 

A model that defines the probability of a word 
presence in a text based on the text that comes 
before and after it. 
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Big Data 

A large dataset that requires scalable technologies 
for storage, processing, management, and anal- 
ysis due to its characteristics of volume, variety, 
velocity, and variability. 


Big Data Analytics 

The process of examining a large amount of di- 
verse data to discover patterns, correlations, and 
trends that provide insights and support deci- 
sion-making. 


Big O Notation 
A theoretical measure of the needed algorithm's 
execution time or memory, given the problem size. 


Bigram 
A sequence of two words in a text. 


Bilingual Evaluation Understudy (BLEU) 

À measure to automatically evaluate the quality 
of machine-translated text from one language 
to another. 


Binary Classification 
A type of classification that predicts only one out 
of two possible classes. 


Binary Tree 
A tree structure in which each node has at most 
two children. 


Binning 
See "Bucketing." 


Biometrics 

The measurable biological and behavioral charac- 
teristics of individuals used for automated recog- 
nition and authentication. 


Black Box 

A metaphor used to describe a system or an al- 
gorithm with unknown internal structure or hidden 
workings, leading to a lack of understanding of 
how it works. 


Boltzmann Machine 

A neural network of nodes that are symmetrically 
connected and make their own decisions about 
activation. 


Boosting 

An approach to reduce bias and variance in super- 
vised learning, also to convert machine learning 
algorithms from weak learners to strong ones. 
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Bootstrap Aggregating (Bagging) 

An ensemble learning method used to improve 
the stability and accuracy of the classification and 
regression models. 


Bot 
An automated software program that performs 
repetitive tasks to simulate human activity. 


Bounding Box 
The coordinates of a rectangle around an area of 
interest or an object. 


Branching Factor 
The number of descendants that a node generates. 


Broadcasting 

A method of matching the dimensions of arrays of 
different sizes in order to allow performing arith- 
metic operations between those arrays. 


Brute Force Search 
A technique to solve a problem by iterating through 
all possible solutions until finding a valid one. 


Bucketing 

A process of transforming a continuous feature 
into multiple binary ones called buckets or bins 
based on a range of values. Also called "Binning." 
Business Intelligence 

A data-driven system that incorporates data col- 
lection, data storage, data analysis, and data vi- 
sualization to support decision making. 
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Calibration Layer 
A post-prediction adjustment for reducing the 
prediction bias. 


Candidate Generation 
The process of selecting initial recommendations 
by a recommendation system. 


Candidate Sampling 

An optimization method for training by calculating 
a probability for all the target classes and a ran- 
dom sample of other classes. 


Capsule Neural Network (CapsNet) 

An artificial neural network composed of a group 
of neurons (capsules) that represent the instanti- 
ation parameters of a specific entity. 
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Case-Based Reasoning (CBR) 
An approach to solve new problems by using solu- 
tions of past similar problems. 


Categorical Data 
The features that have a discrete set of possible 
values. 


Categorical Variable 
See "Categorical Data." 


Causal Language Model 
See "Unidirectional Language Model." 


Central Limit Theorem 
A theorem states that the means distribution of 
independent observations gets close to a normal 
distribution model when the sample size becomes 
larger, regardiess of the statistical shape of the 
population distribution. 


Central Processing Unit (CPU) 
An electronic circuit that executes program instruc- 
tions and controls the performance of a computer. 


Centroid 

The center of a cluster that is specified by a clus- 
tering algorithm, such as k-means or k-median 
algorithms. 

Centroid-Based Clustering 

A clustering algorithm that arranges data into 
non-hierarchical clusters represented by a central 
vector or a centroid. 

Chain Rule 

A rule in probability to calculate any joint distribu- 
tion component of a set of random variables by 
using conditional probabilities only. Also used in 
calculus to calculate the derivative of the compo- 
sition of two or more functions. 

Chatbot 

An Al program that utilizes natural language pro- 
cessing techniques to interact with human users 
through audio or texts conversations. 


Checkpoint 

A practice for saving a snapshot of the variables 
of a model at a given point in time to allow for 
exporting model weights and performing training 
across multiple sessions. 
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Class 
A label that represents one of the enumerated 
target values. 


Class-Imbalanced Dataset 
À problem in classification when the frequencies 
of classes in a dataset are significantly different. 


Classification 

Atype of machine learning algorithms that identify 
the category to which a specific input belongs. 
Also called "Standard Classification." 


Classification Model 
À machine learning model that differentiates 
among two or more discrete classes. 


Classification Threshold 
A value criterion to classify a new observation to 
binary classification. 


Classifier 

An algorithm used for categorizing data into sets 
(or classes) based on behavioral and structural 
features. 


Clickstream Analytics 
Atype of Web analytics that is used to monitor and 
analyze website visitors. 


Clipping 

A normalization technique used to deal with out- 
liers by capping all feature values above or below 
a specific value to fixed value. 


Cloud Computing 

A computing model for enabling access to a scal- 
able set of shared computer resources with the 
self-service provision and on-demand adminis- 
tration. 


Cloud Robotics 

A field of robotics in which robots utilize cloud 
technologies, such as storage and computing, to 
extend their capabilities. 


Cluster 
A set of objects that are of similar features and 
organized by similarities. 


Cluster Analysis 
See "Clustering." 


Clustering 

A method in unsupervised machine learning for 
grouping similar objects into clusters. Also called 
"Cluster Analysis." 
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Co-Adaptation 

À problem in neural networks when neurons are 
highly dependent on other specific neurons in- 
stead of the behavior of the network as a whole. 


Co-Training 

A semi-supervised learning method used in clas- 
sification when the unlabeled data is more than 
labeled data and the dataset has two independent 
and complementary sets of features. 


COBWEB 
An unsupervised and incremental clustering algo- 
rithm that generates a hierarchy of classes. 


Coefficient 
A constant value that is multiplied by a variable or 
an expression. 


Cognitive Architecture 

A hypothesis on the fixed structures of a human or 
artificial mind and the embodied knowledge and 
skills to produce intelligent behavior. 


Cognitive Computing 

The platforms that are based on the scientific dis- 
ciplines of artificial intelligence to simulate human 
thought processes in a computerized model. 


Cognitive Science 

An interdisciplinary study of the mind and its pro- 
cesses and how they can be simulated in ma- 
chines. 


Cognitive System 
See "Cognitive Computing." 


Cohen's Kappa 

A statistical measure to evaluate the degree of 
agreement between two raters or to evaluate a 
classification model performance. 


Cold Data Storage 
The storage of inactive data that is only accessed 
or used rarely. 


Collaborative Filtering 

A technique used in recommendation systems 
to make predictions about the interests of a user 
based on the interests of other users. 


Collaborative Robot (Cobot) 
A robot designed for close interaction with a hu- 
man in a shared workspace. 
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Combinatorial Optimization 

A class of methods concerned with finding an 
optimal solution from a finite set of possibilities, 
when an exhaustive search is not feasible. 


Committee Machine 

A method in machine learning that applies a divide 
and conquer strategy by sharing learning tasks and 
combining responses into a single result. 


Commonsense Knowledge 
The information that helps humans to make sense 
of everyday activities. 


Commonsense Reasoning 

A field of artificial intelligence that focuses on de- 
veloping machines with human abilities to make 
assumptions about typical everyday situations. 


Computational Creativity 

An emerging field in artificial intelligence that fo- 
cuses on the ability of machines to both generate 
and evaluate creative outputs. 


Computational Humor 
À branch of artificial intelligence concerned with 
modeling and generating humor. 


Computational Intelligence (ci) 

A field that focuses on the biological and lin- 
guistical motivation to develop computational 
approaches, including Neural Networks, FUzzy 
Systems, and Evolutionary Computation. 


Computational Learning Theory 

A field of artificial intelligence that focuses on all 
aspects of designing and analyzing machine learn- 
ing algorithms to identify learnable problems. Also 
called "Learning Theory." 


Computational Linguistics 
A scientific field concerned with understanding 
human language from a computational perspective. 


Computational Neuroscience 
A discipline that describes the nervous system 
through theoretical and mathematical methods. 


Computer Vision 

Afield of artificial intelligence that focuses on ana- 
lyzing and understanding visual data (images and 
videos). Also called "Machine Vision." 


Computer-Aided Detection (CADe) 
A system that assists clinical diagnosis tasks by 
automated detection in medical records. 
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Concept Drift 
An unexpected change that happens over time 
to the statistical properties of the target variable. 


Conditional Model 
See "Discriminative Model." 


Conditional Probability 
A probability measure of an event occurring on the 
condition that another event has already occurred. 


Confidence Interval 

A type of interval estimate that contains the range 
of values required to match the confidence level 
for estimating the features of observed data. 


Confirmation Bias 
The tendency of favoring information that confirms 
preexisting beliefs or hypotheses. 


Confusion Matrix 

A matrix that summarizes the performance of a 
classification model and shows how successful 
the predictions were. 


Connectionism 
A type of cognitive modeling that uses artificial 
neural networks to explain intellectual abilities. 


Consistency 
A degree of standardization and freedom of con- 
tradiction among parts of a system or component. 


Constraint Satisfaction 

A decision problem with finite choices and the 
objective is to satisfy a given set of constraints by 
finding values for a set of variables. 


Content Generator 

A machine learning program that generates con- 
tent by using natural language processing tech- 
niques. 


Content Moderation 

The practice of monitoring and applying pre- 
defined rules and guidelines to the users' submis- 
sions to determine the content's appropriateness. 


Content-Based Filtering 

A technique used in recommendation systems to 
make predictions about a user's interests based 
on the item's features and user's likes. 


Continuous Feature 
A feature that has an infinite number of possible 
values. 
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Continuous Learning (or Continual Learning) 
See "Online Machine Learning." 


Continuous Learning System (CLS) 
See "Online Machine Learning." 


Control Theory 

A field of applied mathematics that deals with 
dynamical systems’ behavior by using feedback 
to influence the behavior in order to achieve the 
desired goal. 

Controllability 

The property of an AI system in which external 
entities can intervene in its functioning. 


Convenience Sampling 
A method of conducting quick experiments with a 
dataset that was not collected scientifically. 


Convergence 

A state reached by a model when additional train- 
ing on the existing data will not significantly im- 
prove the model. 

Conversational Artificial Intelligence 

The technologies that use machine learning and 
natural language processing to provide human-like 
interaction with users via text or voice. 


Conversational User Interface 

A user interface that allows users to interact with 
computers in a way similar to a human-to-human 
conversation. 


Convex Function 
A function in which the area above its graph is a 
convex set. 


Convex Optimization 
À process of finding the minimum of a convex 
function using mathematical techniques. 


Convex Set 

A subset of Euclidean space in which a drawn line 
between any two points in the subset stays totally 
within the subset. 


Convolution 
A process of filtering to train weights by mixing the 
convolutional filter and the input matrix. 
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Convolutional Filter 

A filter that performs convolution operations by 
scanning the input while considering its dimen- 
sions. Also called "Kernel" or "Feature Detector." 
Convolutional Layer (or Convolution Layer) 

A layer in a convolutional neural network that uses 
filters to perform convolution operations. 


Convolutional Neural Network (CNN) 
Atype of deep neural network used to create models 
for analyzing and processing images and videos. 


Convolutional Operation 

A mathematical operation of multiplying a slice of 
an input matrix and the convolutional filter and then 
summing all the values in the resulting matrix. 


Convolutional Stride 
The step size of shifting the convolution filters. 


Correlation 
A statistical measure of the extent to which two 
variables are mathematically related. 


Cost 
See "Loss." 


Cost Function 
See "Loss Function." 


Counterfactual Fairness 

A fairness metric that evaluates whether a classi- 
fier makes the same result for identical individuals 
except for one or more sensitive attributes. 
Covariance 

A measure of the variability of two unrelated vari- 
ables. 

Coverage Bias 

A type of selection bias in which the selected data 
does not represent the target population. 


Cross-Entropy 

À way to quantify the difference between two 
probability distributions by generalizing the Log 
Loss to multi-class classification problems. 
Cross-Validation 

A mechanism for evaluating a model generaliza- 
tion on new data by testing the model against one 
or more subsets taken from the training set. 
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Custom Estimator 

An estimator that is designed to follow specific 
directions. 

Custom Model 

A model that accepts inputs specific to a user and 
returns predictions based on that inputs. 


Custom Training 
The process that trains a model to make particular 
predictions. 
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Dark Data 

The data that an organization gathers, processes, 
and stores during business operations but is not 
being used for other purposes such as analytics 
or driving revenue. 

Dashboard 

À graphical user interface that combines infor- 
mation and visualizations to view metrics, bench- 
marks, or indicators that assist in monitoring and 
decision-making. 

Data 

The representation of information in a suitable 
format for storage, processing, or transmission. 


Data Aggregation 

The process of data gathering from several sourc- 
es for analysis or reporting, including data group- 
ing or summarization, such as sum, average, min- 
imum, or maximum. 


Data Analysis 
A systemic examination of data through samples, 
measurement, and visualization. 


Data Analytics 

À concept that consists of collecting, validating, 
processing, and visualizing data to obtain insights 
useful for decision-making. 


Data Annotation 

The process of adding descriptive information to the 
dataset to be used as an input for a machine learning 
model. Also called "Data Labeling" or "Labeling." 
Data Augmentation 

A machine learning technique to create new data 
by manipulating the original data. 
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Data Center 

À physical facility that houses many servers and 
data storage devices with high-speed connectivity 
to manage the organization's applications and data. 
Data Cleansing 

A multistage process of reviewing and correcting 
data to ensure it is in a standardized format, in- 
cluding handling missing values, smoothing noisy 
data, and resolving inconsistencies and duplicates. 
Data Collection 

The process of gathering and measuring data, 
including data acquisition, data labeling, and data 
improvement. 

Data Custodian 

A person or organization in charge of providing IT 
infrastructure and protecting data in conformance 
to data governance policies and practices. 


Data Exhaust 

Any information produced as a byproduct of digital 
activities. 

Data Feed 

A way for providing users with updated data from 
data sources. 


Data Fusion 

The process of combining data from different 
sources to provide more consistent and accurate 
information. 

Data Governance 

The policies, processes, and organizational struc- 
tures that define the decision rights and account- 
abilities to support data management. 

Data Instance 

A selection of a data item from a datatype or a 
dataset. 

Data Integration 

The process of combining data residing in different 
sources into a meaningful unified view to users. 


Data Integrity 
The trust measure for data accuracy, completeness, 
consistency, and compliance with regulations. 


Data Labeling 
See "Data Annotation." 
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Data Lake 
A repository for storing a large amount of raw data 
in different formats. 


Data Marketplace 

An electronic platform that primarily provides data 
or any related services and allows the buying and 
selling of machine-readable data. 


Data Mart 
A data warehouse subset that focuses on a spe- 
cific line of business or subject area. 


Data Migration 
The process of transferring data from one system 
to another. 


Data Mining 
The process of discovering patterns from a large 
amount of data and extracting useful information. 


Data Model 

A group of conceptual methods for describing 
data, data semantics, data relationships, and con- 
sistency constraints. 


Data Modelling 

The process of building a conceptual representa- 
tion of data and their relationships to be stored in 
a database. 


Data Owner 
An individual or people who are accountable for 
particular data. 


Data Parallelism 

A technique used to speed up training or inference 
by duplicating the model on multiple devices then 
splitting up the input data over these devices. 


Data Point 
A discrete piece of information derived from 
a larger dataset. 


Data Processing 
A wide range of operations that can be carried on 
data by manual or automated means. 


Data Profiling 
The process of examining the data and creating 
informative summaries about it. 


Data Quality 
A measure of data usefulness and fit for purpose, 
including completeness, consistency, and accuracy. 
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Data Replication 

The duplication of data on several computers with- 
in a distributed system to ensure the consistency 
level for accessing information. 


Data Representation 
The form that represents how to store, process, 
and transmit data. 


Data Sampling 

A process of selecting a representative subset 
of data samples from a larger dataset to analyze 
patterns and trends. 

Data Science 

A multidisciplinary field that focuses on extracting 
meaningful information and insights from data through 
a process of discovery or hypothesis testing. 


Data Scientist 

An analytical data specialist who has technical 
skills to solve complicated problems using data 
science tools and techniques. 

Data Source 

Arepository of dataset, metadataset, database, or 
metadata from where data or metadata is available 
for analysis. 


Data Steward 

A data management role that includes implement- 
ing and maintaining data govermance policies with- 
in an organization. 

Data Structure 

A way to organize and store data in a computer's 
memory for efficient access and processing. 


Data Subject 

À person who is identifiable by a recorded per- 
sonal data. 

Data Variability 

The changes of datasets in terms of the rate of 
transmission, format, semantics, or quality. 


Data Variety 
The diversity of datatypes and formats such as struc- 
tured, semi-structured, and unstructured data. 


Data Velocity 
The flow rate of creating, transmitting, storing, 
analyzing, or visualizing data. 
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Data Veracity 
The accuracy and completeness of data. 


Data Visualization 

À graphical representation of information that 
highlights patterns and trends in data and aids 
the reader in gaining quick insights. 


Data Volatility 
The characteristics of data related to its rate of 
change over time. 


Data Volume 
The amount of data that impacts computation and 
storage resources as well as their management. 


Data Warehouse 
A repository of data gathered from multiple sourc- 
es into a central and consistent data store to sup- 
port data analysis. 


Data-Driven Decision Making (DDDM) 

The process of making decisions based on data 
rather than just intuition or observation. Also called 
"Data-Informed Decision-Making (DIDM)." 


Data-Informed Decision-Making (DIDM) 
See "Data-Driven Decision Making (DDDM)." 


Database 
A collection of data organized based on a con- 
ceptual structure to support an application area. 


Database Administrator (DBA) 
À person who manages and maintains the data- 
bases and their systems. 


Database as a Service (DBaas) 
The platform that provides database functionality 
on the cloud. 


Database Normalization 

A process of designing data in a database by es- 
tablishing relationships between tables to remove 
redundancy and inconsistent dependency. 


Database Schema 
The overall design of a database. 


Dataframe 
A popular datatype used to represent datasets. 


Datalog 

A declarative logic programming language that 
includes a finite set of facts and rules and can be 
used in deductive databases. 
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Dataset (or Data Set) 
A collection of data that can be accessed in one 
or more formats. 


Datatype (or Data Type) 
A specified set of data objects that have a defined 
data structure and a set of allowed operations. 


Decentralized Artificial Intelligence 
See “Distributed Artificial Intelligence (DAI).” 


Decision Analysis 
A normative approach for making optimal choices 
under uncertain conditions. 


Decision Boundary 

The separator between the classes that a mod- 
el learned in binary or multi-class classification 
problems. 

Decision Support System (DSS) 

An information system that assists in evaluating al- 
ternatives, solving problems, and making decisions. 


Decision Theory 
A study of the reasoning behind choices and de- 
cisions under uncertainty. 


Decision Threshold 
See "Classification Threshold." 


Decision Tree 
A supervised learning algorithm that uses a tree- 
like graph to perform decision analysis. 


Decision Tree Learning 
The process of building a decision tree from data 
to perform classification or regression tasks. 


Decoder 
A machine learning system that transforms pro- 
cessed representation into raw representation. 


Deconvolution 
See "Transposed Convolution." 


Deconvolutional Neural Network (DNN) 
A convolutional neural network (CNN) that works 
inversely. 


Deduction 
See "Deductive Reasoning." 


Deductive Classifier 
An adaptive type of inference engine used to 
enhance the classification accuracy. 
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Deductive Reasoning 


The process of inferring conclusions based on 
generally accepted statements or facts. Also 
called "Deduction." 


Deep Belief Network 
A type of deep learning network composed of 
stacking several Restricted Boltzmann Machines. 


Deep Blue 

A chess-playing supercomputer that won against 
the world champion in chess "Garry Kasparov" 
in 1997. 


Deep Learning 

A subfield of machine learning that uses multiple 
neural network hidden layers to solve complex 
problems by identifying the most essential fea- 
tures of the input data. 


Deep Model 
See "Deep Neural Network (DNN)." 


Deep Neural Network (DNN) 

A type of artificial neural network (ANN) contain- 
ing multiple layers between the input and output 
layers. Also called "Deep Model." 


Deep Q-Network (DQN) 

An algorithm that combines reinforcement learning 
and deep neural networks to solve complex prob- 
lems and deal with high-dimensional environments. 


Deepfake 
Atype of synthetic media in which a person in an im- 
age, video, or audio is replaced with someone else. 


Demographic Data 
The socioeconomic data about a population such as 
gender, age, education, income, and employment. 


Demographic Parity 

A fairness metric that is satisfied if the classifica- 
tion results of a model are independent of a given 
sensitive attribute. 


Denoising 
An approach to artificially add noise to the dataset 
and then the model attempts to remove it. 


Dense Feature 
A feature that most of its values are non-zero. 
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Dense Layer 
See "Fully Connected Layer." 


Depth 
The number of layers that learn weights in a neural 
network. 


Depthwise Separable Convolutional Neural 
Network (sepCNN) 

A convolutional neural network architecture where 
depthwise separable convolution replaces Incep- 
tion modules. 


Dialog Management 
A process of selecting the appropriate next move 
in a dialogue to achieve the intended goal. 


Dialogue System 
A computer system that can converse with a hu- 
man in natural language. 


Differential Privacy 

A mathematical definition of privacy in machine 
learning that allows collecting, analyzing, and 
sharing statistics about a dataset based on per- 
sonal data while protecting individuals" privacy. 
Digital Image Processing 

See "Image Processing." 


Digital Signal Processing (DSP) 
A subfield of signal processing that focuses on using 
computers to analyze and manipulate digital signals. 


Dimension 
An entry in a feature vector, a level of coordinate 
in a tensor, or an element in an embedding layer. 


Dimension Reduction (or Dimensionality Reduction) 
A method of reducing the number of variables in a 
feature vector to lower the complexity of a model. 


Discrete Feature 
A feature that has a finite number of possible values. 


Discrete Wasserstein Distance 
See "Earth Mover's Distance (EMD)." 


Discriminative Model 

A model used in classification and regression to 
predict labels by defining the conditional proba- 
bility of the output given the features and weights. 
Also called the "Conditional Model." 
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Discriminator 

A part of the generative adversarial network that 
examines whether an example created by the 
generator is real or fake. 


Disparate Impact 

A situation in which an algorithmic decision-mak- 
ing process benefits or harms specific subgroups 
more than others. 


Disparate Treatment 

A situation in which sensitive attributes of subjects 
are factored into an algorithmic decision-making 
process such that different subgroups are not 
treated equally. 


Distributed Artificial Intelligence (DAI) 

A subfield of AI that focuses on solving problems 
using coordinated and concurrent operations. Also 
called "Decentralized Artificial Intelligence." 


Distributed Computing 

A distributed system that includes multiple soft- 
ware components running on multiple computers 
as a single system. 


Distributed Data Processing (DDP) 
A form of data processing where the operations 
are dispersed among a network of computers. 


Distributed File System (DFS) 

A file management system that allows files to be 
stored across a collection of devices while giving 
a single-file-system view to clients. 


Divisive Clustering 

A type of hierarchical clustering in which all exam- 
ples are first grouped into a cluster, and then the 
cluster is iteratively divided into a hierarchical tree. 


Domain Adaptation 

A subfield of machine learning that aims to apply 
an algorithm trained in a source domain to another 
related target domain. 


Downsampling 
See "Undersampling." 


Driver Assistance 

A system that is designed to support the driver in 
the driving tasks and enhance safety and comfort. 
Also called "Advanced Driver Assistance Systems 
(ADAS)." 
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Driverless Car 
See "Autonomous Vehicle." 


Dropout Regularization 
The process of removing a random selection of 
units in a network layer for a single gradient step. 


Dynamic Model 
À model that is trained online with data continu- 
ously entering the model. 
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Eager Learning 


A learning method that processes the training data 
before receiving a query. 


Early Stopping 

A regularization method that includes stopping 
a model training before finishing the decrease of 
the training loss. 


Earth Mover's Distance (EMD) 

A metric for the similarity between two probability 
distributions. Also called "Discrete Wasserstein 
Distance." 


Echo State Network (ESN) 

A type of reservoir computing that creates a ran- 
domly connected recurrent network and trains 
suitable output connection weights to produce 
the desired output. 


Edge Computing 

À computing approach in which the computation is 
executed in the network edge, such as loT sensor 
devices or local appliances, instead of data centers. 


Eliza Effect 

À humans' tendency to presume that machines 
behave similarly to humans by associating terms 
from prior experience. 


Embedding 
The process of converting a high-dimensional 
vector into a low-dimensional space. 


Embedding Space 

The relatively low-dimensional vector space to 
which the features from a higher-dimensional vec- 
tor space are mapped. 

Embodied Agent 


An agent that interacts with an environment via a 
physical or virtual body. 
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Embodied Artificial Intelligence 

A field that focuses on incorporating Al capa- 
bilities into robots and enables them to learn by 
interacting with the surrounding environment. 


Embodied Cognitive Science 

A field that aims to study the mechanisms of de- 
veloping cognitive processes and emphasize the 
role of the interaction with the environment. 


Emotion Artificial Intelligence 
(or Emotional Artificial Intelligence) 
See "Affective Computing." 


Emotion Recognition 
The process of identifying emotions expressed in 
text, speech, or image. 


Empirical Risk Minimization (ERM) 
A learning principle to select the model that mini- 
mizes loss on the training set. 


Encoder 
A machine learning system that transforms raw 
representation into processed representation. 


Ensemble 
A method of combining the predictions of multiple 
models to obtain better predictive performance. 


Ensemble Averaging 

A technique used in artificial neural networks to 
produce a single averaged model from many mod- 
erately accurate models. 


Entity 
A thing (physical or non-physical) that can be dis- 
tinctly identifiable. 


Entity Annotation 

The process of identifying, extracting, and tagging 
entities in the text to help the machine understand 
unstructured sentences. 


Environment 
À world in reinforcement learning that contains the 
agent and allows it to observe the state of that world. 


Epoch 
A full training pass over the whole dataset. 


Epsilon Greedy Policy 
A policy in reinforcement learning that follows a ran- 
dom policy with epsilon probability or a greedy policy. 
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Equality of Opportunity 
A fairness metric to examine if a classifier predicts a 
preferred label equally for all the attribute's values. 


Equalized Odds 
A fairness metric to examine if a classifier predicts 
a label equally for all the attribute's values. 


Error-Driven Learning 

A type of learning in which an agent uses infor- 
mation about the difference between current and 
target behavior to modify its behavior. 


Estimator 
An equation in machine learning used to select the 
best data model based on real observations. 


Evaluation Data 
See "Test Dataset (or Test Data, Test Set)." 


Evolutionary Algorithm (EA) 
An algorithm inspired by natural evolution to provide 
solutions for search and optimization problems. 


Evolutionary Computation 
See "Evolutionary Algorithm (EA)." 


Example 
An example in a dataset that consists of one or 
more features and sometimes a label. 


Experience Replay 

A memory buffer in reinforcement learning to store 
state transitions acquired during the learning pro- 
cess and later use them for training the model. 


Experimenter's Bias 

Atype of confirmation bias where the experimenter 
keeps training a model until confirming a pre-ex- 
isting hypothesis. 


Expert System 
A computer system that mimics the ability of a 
human expert to make decisions in a particular 
field based on a knowledge base that includes 
facts and rules. 


Explainability 

The ability to explain in understandable terms to 
humans the important factors that influence the AI 
system results. Also called "Explainable Artificial 
Intelligence (XAI)" or “Interpretability." 


Explainable Artificial Intelligence (xA1) 
See "Explainability." 
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Exploding Gradient Problem 

Anissue found in training neural networks when large 
error gradients accumulate, leading to an unstable 
model due to very large updates to the weights. 


Exponential Distribution 
À probability distribution that measures the time 
between events. 


Extended Reality 

A combination of Al with virtual, augmented, and 
mixed reality technology to add advanced capa- 
bilities. 

External Data 

The data stored in external sources. 


Extract, Transform and Load (ETL) 

The process of collecting data from different 
sources, converting it into an appropriate format, 
and storing it in a centralized repository. 


Extrapolation 
The process of predicting the value of variates 
outside the observations’ range. 


F 


sn. de Il JBail &lSsilo 

Sly Losic Auaoll ill Quai 8 disai &l$uiso 
œuuu âuo jé aigai il ssoi. jus Las ciläiiio 
.oljoil Ês 6 past Cilissill 

gl aujoi 

ls y ägll ous Giloïsl gsjgi 


sioall zülgil 
jjsollo Lalpalll gëlgll &iâi 20 ,clilbualll Sil 305 
Losäio CililSol &alAll hlisolla 


ab cl 
4 jslno (à &ijsall Aililull 

Juoaillg Jigsiïllg alysiuull 

«SSH gsgluo (6 leijsig auulio 


wilpätul psûi 


F-Score (or F1-Score) 

A measure of a model's accuracy in a binary clas- 
sification by calculating the harmonic average of 
the precision and recall. 


Face Recognition (or Facial Recognition) 
A computer system capable of identifying a person 
from an image or a video frame. 


Fairness Constraint 
À constraint applied to an algorithm to make sure 
that one or more fairness definitions are met. 


Fairness Metric 
A measurable definition of fairness. 


Fall-Out 
See "False Positive Rate (FPR)." 


False Negative (FN) 
A result that the model incorrectly predicted as 
negative while it is actually positive. 


False Negative Rate (FNR) 
The proportion of actual positive examples which 
are predicted as negatives. Also called "Miss Rate." 
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False Positive (FP) 
A result that the model incorrectly predicted as 
positive while it is actually negative. 


False Positive Rate (FPR) 
The proportion of actual negative examples which 
are predicted as positives. Also called "Fall-Out." 


Fast-and-Frugal Tree (FFT) 
A simple decision tree that is used for classification 
problems with limited information. 


Fault Tolerance 
The ability of a system to continue performing the 
required function even with the presence of errors. 


Feature 

À property or characteristic of data that can be 
used as an input in making predictions. Often used 
as a synonym for "Attribute." 


Feature Cross 
A synthetic feature that is created by multiplying 
two or more features. 


Feature Detector 
See "Convolutional Filter." 


Feature Engineering 
The process of selecting and creating useful fea- 
tures from raw data to train a model. 


Feature Extraction 

The process of generating or deriving informative 
and non-redundant variables from raw data to 
describe the properties of the objects or entities. 


Feature Learning 

A collection of techniques for automatically finding 
the representations needed for feature detection 
or classification from raw data. 


Feature Map 

The output of applying a convolution operation to 
input data by using a filter. Also called "Activation 
Map." 


Feature Normalization 
À way to change a feature's range of values to a 
standard range of values, usually -1 to +1 or Oto 1. 


Feature Scaling 
À way to change a feature's range of values to 
match the range of other features in a dataset. 
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Feature Selection 
A process of reducing the dimensions that the mod- 
els work with by dismissing irrelevant features. 


Feature Set 
A collection of features on which a machine learn- 
ing model trains. 


Feature Space 
A set of features relevant to certain properties of 
an object or event under study. 


Feature Vector 
A set of feature values that represent an example 
passed into a model. 


Federated Learning 

A distributed machine learning approach that trains 
a model on multiple devices with local data to pre- 
serve data privacy. 


Feedback Loop 
À situation in machine learning where a model's 
predictions impact the training data. 


Feedforward Neural Network 

An artificial neural network in which the information 
always travels in one direction, from the input layer 
to the output layer. 


Few-Shot Learning 
An approach to learn effective classifiers from a 
small number of training examples. 


Fine Tuning 
The process of adjusting the parameters of a trained 
model to meet the requirements of a new problem. 


Flow-Based Chatbot 

A rule-based chatbot that has a predefined se- 
quence of conversations. Also called "Rule-Based 
Chatbot" or "Keyword-Based Chatbot." 


Forget Gate 
The part of a Long Short-Term Memory cell that 
regulates how information flows through the cell. 


Formal Language 
À group of words whose letters are chosen from 
the alphabet according to a set of rules. 


Forward Chaining 

A logical reasoning approach that starts with 
known facts and applies rules iteratively to get 
new facts and consequences. 
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Forward Propagation 

The process of calculating and storing interme- 
diate variables in a neural network from the input 
layer to the output layer. 


Fractionally Strided Convolution 
See “Transposed Convolution." 


Frame 
A data structure that represents stereotyped situa- 
tions to divide knowledge into substructures. 


Frame Language 
A language that uses frames for knowledge rep- 
resentation. 


Frame Problem 
A problem of specifying what remains unchanged 
as a result of actions or events. 


Friendly Artificial Intelligence (FA) 

À hypothetical Artificial General Intelligence (AG) 
that supports human values and has a positive 
effect on humanity. 


Fully Connected Layer 

A hidden layer in an artificial neural network where 
each node is connected to every node in the next 
hidden layer. Also called "Dense Layer." 


Fuzzy Logic 

A type of logic system that allows a wider expres- 
sion of logical statements by providing a multi-val- 
ued distinction between truth and false. 


Fuzzy Rule 
A rule in fuzzy logic that is written in the form of an 
if-then statement. 
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Game Theory 
À branch of mathematical modeling that studies 
the conflict and cooperation among rational de- 
cision-makers. 


Gaussian Distribution 
See "Normal Distribution." 


General Artificial Intelligence 
See “Artificial General Intelligence (AGI).” 


Generalization 
The ability of a model to provide correct predic- 
tions based on new data. 
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Generalization Curve 
A loss curve that illustrates the training set and the 
validation set to help in detecting overfitting. 


Generalized Linear Model (GLM) 

An expansion of linear regression that allows different 
output distribution functions to specify the variance 
between observations and the predicted values. 


Generative Adversarial Network (GAN) 

An unsupervised machine learning algorithm com- 
posed of two neural networks, a generative net- 
work and a discriminative one, compete with each 
other in a zero-sum game framework. 


Generative Model 
A machine learning model that can create new 
examples similar to the training dataset. 


Generative Pre-Trained Transformer (GPT) 
À family of large language models based on a 
transformer. 


Generative Product Design 


The software that generates drafts with all of the 
defined features and requirements to create prod- 
ucts designs. 


Generator 
A part of the generative adversarial network that 
creates fake data. 


Genetic Algorithm 

À type of evolutionary algorithms that uses the 
principles of natural selection to find suitable solu- 
tions for optimization problems. 


Geometric Distribution 

A probability distribution that shows the number of 
trials needed to get the first success in a Bernoulli 
experiment. 


Glowworm Swarm Optimization (GSO) 

A swarm intelligence algorithm that mimics the be- 
havior of the lighting bugs to capture the maximum 
multimodal function. 


Gradient 

A vector of partial derivatives of the model function 
that measures the change in weights with respect 
to the change in error. 


Gradient Boosting 

A technique in machine learning used for classifi- 
cation and regression to create a strong prediction 
model from weak prediction models. 
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Gradient Clipping 

A mechanism to reduce the exploding gradient 
problem by restricting the gradients' maximum val- 
ue when training a model using gradient descent. 


Gradient Descent 

An optimization algorithm for finding the minimum 
of a function by iteratively taking steps that are 
proportionate to the negative of the gradient. 


Graph Database (GDB) 
À database that uses nodes and edges to repre- 
sent data and its relations. 


Graph Neural Network (GNN) 
A type of deep learning method that processes 
data represented as graphs. 


Graph Traversal 
A process of visiting all nodes in a graph. 


Graphical Model 

A class of statistical models that uses graphs to 
represent the probabilistic relationships between 
variables. 


Graphics Processing Unit (GPU) 
An electronic circuit that performs parallel com- 
putation to accelerate the creation of images for 
display on an electronic device. 


Greedy Algorithm 

An algorithm that chooses the best immediate 
solution for a problem while finding the overall 
optimal solution. 


Grid Computing 
A set of networked computers that work as a virtual 
supercomputer to execute large tasks. 


Ground Truth 

The information acquired from direct observation 
rather than inference. 

Group Attribution Bias 

The assumption that what is true for one individual 
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An open-source framework that provides distrib- 
uted processing of large datasets across com- 
puters' clusters. 
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Hadoop Distributed File System (HDFS) 


An open-source storage engine, based on the 
Googjle File System (GFS) architecture, designed 
for efficient storage of large files. 


Hashing 

A machine learning mechanism for bucketing cat- 
egorical data when the number of categories is 
large while the actual number of categories in the 
dataset is relatively small. 

Heteronomy (or Heteronomous) 

A characteristic of a system operating under the 
constraint of external control. 

Heuristic 

Atechnique to provide a quick solution to a prob- 
lem that may not be perfect but approximate. 


Heuristic Search 
A technique to find a good solution, not neces- 
sarily the optimal one, within a reasonable time. 


Hidden Layer 
A synthetic layer that sits between the input and 
output layers of a neural network. 


Hierarchical Clustering 
A type of clustering algorithms that is suitable for 
hierarchical data to generate a tree of clusters. 


Hinge Loss 

A type of loss functions for classification that aims 
to maximize the margin between training exam- 
ples and the boundary by locating the decision 
boundary as far as possible from each example. 
Holdout Data 

The dataset that is intentionally not used during 
the training and will be used to evaluate the mod- 
el's generalization ability. 

Horizontal Scaling 

A method of increasing the performance, storage 
capacity and availability by partitioning and repli- 
cation datasets across clusters of servers. 

Hot Data Storage 

A stored data that can be accessed quickly for 
immediate use. 

Human Action Recognition 

See "Action Recognition." 


äcjgall ciläloll c195lm lai 
&pusil Cilalell Jeläll cyjaill aén0g «J292 


D 


Hougi 


9 Losie äofall ililull a 1oail & ll jalaï 4 à él 
6 Liliall ,leall 51011 Hg$s Cas 6 lus Ciliall ss 
Lui lire Cililull &cgaso 


(ils jé où SilS pré pSai 


«ssl Jl 

Jill ga us 1 sà dial gypau J= jaysäil Liglul 
«ssluyl dis 

GS «Jiolll Jai 6jgpll quul esse Je stay ciglaul 
| älgäno &iioj ôpé Uguaé 
äuaell ä<ull 

Won gsoni 

&ojgll Slilull anulioll gsosill “iliojilgs type Egi 
“ilegosoll (jo ju auloil 

iloo 5jluà 

paheï il césgs sil césnaill juil Jlas 0 gi 
3 Cibus (Gil ye s93ll9 Luysill àliol cpu uinolgll 
Jlio JS ce ül$oill jx8 seul + lpall 


éliüuell “ill 
euyaill el 08 ge psñiuit fol ill Cililull äcgano 
piseill Cle aigoill ôja eyâil pxstulug 


. yüsl guuoi 
panäi dl Ge aille cuil &œuug <lsill 6sUjl &äs 
-eslg3ll ce ilegano Lil lgsuuig ililull Hilegaso 


hill ill oujai 
Asjoûll olssiurlll &epuu lgall Jonaoll Sos éije cl 


äpäull Jlaâill ççle câjaill 
"JLaill le câfeill" ail 


74 


75 


Human Augmentation 
A field that focuses on enhancing human abilities by 
using technology as an integral part of the human body. 


Human Takeover 
A chatbot fallback feature that allows the contact 
center agent to take control of the conversation. 


Human-Computer Interaction (HCI) 
The study of the design and implementation of the 
interaction between users and computers. 


Human-Machine Teaming 

The combination of human interaction and ma- 
chine intelligence capabilities. 

Human-Robot Interaction (HRI) 

A field that focuses on exchanging information 
and actions between human and robot to perform 
tasks through a user interface. 

Hyper-Heuristic 

A high-level search methodology that automates 
the process of selecting or combining heuristics 
to solve complex search problems. 


Hyperparameter 


À special parameter to tweak the model training 


and estimate the model parameters. 


Hyperparameters Tuning 

The process of adjusting the hyperparameters 
through trial-and-error to find which set of hyper- 
parameters gives the best accurate model. 


Hyperplane 
À separating boundary between two subspaces 
used to separate the data space for classification. 


Hypothesis 
An idea or a prediction that needs to be investi- 
gated and tested by an experiment. 


Image Classification 
See "Image Recognition." 


Image Processing 

The use of an algorithm to manipulate images 
or extract useful information. Also called "Digital 
Image Processing." 


pal jai 

duiaill plssiul Glull ils jijeï le jSy Jo 
OULuilll pus go Jolsio <js Leils 

«spi Js 

j$po cabigo (,lgiù oui &islso tigu {à ail 6j1o 
&slall Lig1 po Ilu äfslsoll Jlaïill 

Quulallg oil cu Jelaïll 

éjenlo osstuell qu Jeläill Hâiig posa &uulys 
uulil 

Qillg oluiull qu Jolsil 

AN LS iljsäg çpiull Jelaïll Jolsi 


cagaggllg Gluilll qu Jeléil 
Silui  jléslits iLeoleall Jo Le je lé 
pull àgslg &ul te plgall sil ig19}llo 


Silà sLuyl 
JU êloe aioïl (,Le Jaaï (çgtuall &ylle vtsy duagio 
&sûooll ul “ul Jallgsos ol sl jul ruban ill 


Cilloles psäig aigaill Lust Jasaiïl als Joles 
-esgill 

huäll “ilolos hu 

Üasllg &paill gpl ve Bull Culoles Hu lac 
gigaill Ghoï Lil ciloledll äcgano le jgiall 
&às pisill 

slaill 53210 (Sgiuuo 

äsbuo Jal ps cueyé cuisluo qu bal ss 
“àsnaill Jai ÿe ililull 

ie 

&pill Syl ve Jislg us Lil slas gui of ëpSé 


jpsall cha 

"jgnall Lie céjoill" Jail 

Jgall äslleo 

Cilogleo sljsiul ai Jg-all äslleol &sojjlgs plssiul 
"äoëjll jo all äslleo" Läil aude lai .Lgio ëxiño 


Image Recognition 

A process that identifies objects, patterns, or con- 
cepts in images. Also called "Image Classification" 
or "Visual Recognition." 


Image Segmentation 
A process of dividing a digital image into sets of 
pixels. 


ImageNet 

A large image database consisting of over 14 mil- 
lion annotated images classified into 20,000 cat- 
egories, prepared for computer vision research. 


Imbalanced Dataset 
See "Class-Imbalanced Dataset." 


Implicit Bias 
Atype of bias in which association or assumption is 
made according to a person's mind and memories. 


Imputation 
The process of replacing missing data with mod- 
eled or estimated data. 


In-Database Analytics 

An analysis model where data processing is inte- 
grated into the database. 

In-Group Bias 

A type of bias in which a favoritism is given to a 
person's group or characteristics. 


In-Memory Database (IMDB) 
A database that stores data in the main memory 
instead of a disk drive to accelerate response times. 


Incompatibility of Fairness Metrics 

A thought that some fairness concepts cannot be 
satisfied simultaneously, and therefore there is no 
single metric to quantify fairness in all machine 
learning problems. 


Incremental Learning 
Amachine learning approach in which training and 
learning are performed continuously whenever 
new examples appear. 


Independent and Identically Distributed (lp) 
The random variables that have the same proba- 
bility distribution and are mutually independent. 


Individual Fairness 
A fairness metric that examines whether similar 
individuals are classified in a similar way. 
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Induction 
See "Inductive Reasoning." 


Inductive Programming 

A field concerned with designing algorithms for 
computer programs with complex control flow 
from incomplete specifications. 


Inductive Reasoning 
The process of inferring general conclusions based 
on particular instances. Also called "Induction." 


Industrial Robot 
An automated and programmable robot that is 
used in industrial applications. 


Industrial Robot Cell 
An industrial robot system(s) that include(s) ma- 
chines and protected areas. 


Industrial Robot Line 

A set of industrial robot cells that perform some 
functions with associated equipment in protected 
spaces. 


Inference 
A process for making predictions by applying a 
trained model to unlabeled examples. 


Inference Engine 

A component of an expert system that performs 
reasoning to conclude new information from facts 
and rules stored in a knowledge base. 


Information 
A processed data that has a meaning in a certain 
context. 


Information Extraction 

A process of obtaining knowledge by going 
through a text to find specific object classes and 
their relationships. 


Information Retrieval 

An area of computer science concerned with 
searching for relevant information or documents 
from different sources. 


Information Theory 
The study of operations on digital information such 
as transmission, storage, and compression. 


Input Data (or Input) 

Any form of data entered into an information pro- 
cessing system or any of its components for the 
purpose of storage or processing. 
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Input Layer 
The first layer in a neural network that receives 
the input data. 


Instance 
See "Example." 


Intelligence Amplification (IA) 
The use of technology to augment human intelligence. 


Intelligence Augmentation 
See "Augmented Intelligence." 


Intelligence Explosion 

À hypothesis that describes a state when super- 
intelligent machines can produce even better ma- 
chines surpassing human intelligence. Also called 
"Technological Singularity" or "Singularity." 


Intelligent Agent (IA) 
An autonomous entity that perceives its environ- 
ment and takes actions to achieve specific goals. 


Intelligent Automation 
The use of Al and other technologies to perform 
tasks and processes without human intervention. 


Intelligent Control 

Atype of control mechanisms that utilize different 
artificial intelligence techniques to emulate char- 
acteristics of human intelligence. 


Intelligent Personal Assistant 
A software agent that uses Al to assist people with 
basic tasks. 


Intelligent Product 
A product with intelligent capabilities that contin- 
uously evolves to meet customers’ needs. 


Intelligent Robot 

A robot that can carry out tasks by sensing its 
surroundings, interact with external sources, and 
adapt its behavior accordingly. 


Inter-Annotator Agreement 
See “Inter-Rater Agreement." 


Inter-Rater Agreement 

A metric to calculate how often human raters 
agree when doing a task. Also called "Inter-An- 
notator Agreement" or "Inter-Rater Reliability." 


Inter-Rater Reliability 
See "Inter-Rater Agreement." 
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Internet of Things (loT) 
A network of electronic devices, software, and sen- 
sors that allow machines to interact with each other. 


Internet of Things (loT) Device 
A sensor or an actuator in an loT system that in- 
teracts with the physical world. 


Interpretability 
See "Explainability." 


Intersection over Union (lou) 

A metric to measure the accuracy of object or 
image detection tasks by comparing the predicted 
bounding box versus the actual one. 


Intrinsic Motivation 

A mechanism that encourages an agent to per- 
form a behavior for its inherent satisfaction without 
direct feedback from the environment. 


Item Matrix 

A matrix of embeddings in recommendation sys- 
tems generated by matrix factorization that con- 
tains latent signals about each item. 


Iteration 
A single update to the weights of a model during 
the training. 
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K-Fold Cross-Validation 

A cross-validation technique that randomly splits 
a given dataset into a K equal subsets (folds) used 
to train and evaluate a machine learning model. 


K-Means (or K-Means Clustering) 
A clustering algorithm that uses mean to classify 
data into k groups based on features similarities. 


K-Medians (or K-Medians Clustering) 
A clustering algorithm in which the median deter- 
mines the clusters’ centroids. 


K-Modes (or K-Modes Clustering) 
A clustering algorithm for categorical data by us- 
ing the mode to define the clusters’ centroids. 


Keras 
A popular Python API for machine learning that 
runs on many deep learning frameworks. 


Kernel 
See "Convolutional Filter." 
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Kernel Function 
A similarity function between two instances to map 
the data into a high-dimensional vector space. 


Kernel Support Vector Machine (KSVM) 

A classification algorithm that transforms input data 
vectors to a higher dimensional space to increase 
the margin between positive and negative classes. 


Kernel Trick 
A method to project non-linear data into a high- 
er-dimensional space to easily classify the data. 


Keypoint 
The position of interesting features in an image. 
Also called "Landmark." 


Keyword-Based Chatbot 
See "Flow-Based Chatbot." 


Knowledge Base 
A database that includes inference rules and infor- 
mation about human experience in a field. 


Knowledge Engineering 

A discipline that focuses on obtaining knowledge 
from domain experts and other sources and stor- 
ing it into a knowledge base. 


Knowledge Graph 

A network that represents real-world entities and 
their relationships. Also called "Semantic Net- 
work." 


Knowledge Representation 

A field of Al that focuses on representing re- 
al-world information in a form that the computer 
can use to perform complex tasks. 


Knowledge Representation and Reasoning (KRR) 


A subfield of AI concerned with encoding human 
knowledge in computer systems to draw logical 
conclusions. 


Knowledge-Based System (KBs) 

À computer system that uses a knowledge base 
to capture the knowledge of human experts and 
an inference engine to solve problems. 
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L1 Loss 

Aloss function used to calculate the absolute val- 
ue of the difference between the predicted values 
and the actual values. 
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Li Regularization 

A type of regularization that adds a penalty to 
weights equivalent to the sum of the weights' 
absolute values. 

L2 Loss 

See "Squared Loss." 


L2 Regularization 

A type of regularization that adds a penalty to 
weights equivalent to the sum of the weights' 
squares. Also called "Ridge Regularization." 
Label 

The answer or result part of an example in super- 
vised learning. 


Labeled Example 
An example that has a label. 


Labeler 
See "Annotator." 


Labeling 
See "Data Annotation." 


Lambda 
See "Regularization Rate." 


Landmark 
See "Keypoint." 


Language Model 
A model that determines the probability of a given 
sequence of words occurring in a sentence. 


Language Model For Dialogue Applications (LaMDA) 
A large language model based on a transformer for 
generating conversational responses. 


Large Language Model (LLM) 
A language model with a high number of parameters. 


Latency 
A delay in data transmission or processing. 


Layer 

A group of neurons in a neural network that pro- 
cesses the input features or the output of those 
neurons. 

Layer Width 

The number of neurons in a specific neural net- 
work layer. 
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Lazy Learning 
A learning method that defers the processing of 
training data until receiving a query. 


Leaky ReLU 

A kind of activation function based on Rectified 
Linear Unit (ReLU), but it has a small slope for 
negative values. 


Learning Rate 

A hyperparameter used in optimization algorithms 
to define the step size at eachiteration. Also called 
"Step Size." 


Learning Theory 
See “Computational Learning Theory.” 


Least Square Error (LSE) 

A method to find the best fit of the data to minimize 
the square of errors between the actual data and 
the fitted line. 


Least Squares Regression 
A linear regression trained by reducing the L2 loss. 


Leave-One-Out Cross-Validation (LOOCV) 


A special case of K-Fold Cross-Validation where the 
number of folds equals the number of observations. 


Likelihood 


À probability of an occurrence within a specific 
category or class. 


Linear Algebra 
A branch of mathematics that allows the manipu- 
lation of linear equations. 


Linear Model 
A way to describe a response variable as a linear 
combination of predictor variables. 


Linear Regression 

A type of statistical analysis that shows the re- 
lationship among multiple variables to create a 
predictive model and illustrate trends in data. 


Locally Weighted Regression (LWR) 


A non-parametric technique that performs regression 


around a point of interest using local training data. 
Location Data 


The data related to the geographical position of a 
device or other asset. 


Jgausil phil 

GA os ca pill ciilu & alla 385 lai à y 
&puioll gl Uls 

«äsmaall &lsll éssgll Le soie uiill Jos co gai 
“llull sal po (Jo al :ylg 


plaïll ys20 
pes sssl cuil Slojles $6 pssiu bus Joles 
."éghall e5s" Al qe àllaig JIySi JS 56 6ghall 


… plaill äyi 
" uugall eleill äyäi" ail 
Jill eaugill al 


ee dhläil Sililull &ocilo Juaal «ic ygiell àärph 
&ocllall Lg ädeall “ililull cpu easill 


«syéall Cilespoll jlssil 

2 J] juil älls Jai Guy Ge Lijso las jlsil 
ébñi Jje Sub ve gbläïo Gas 

Ug 4 Lilegono le abläioll (ë ill po Al all 
«ilsmbioll 3191 Ugluo Lilegasall 51e Lgsà 
&aajl 


20 câbüa ol dia (pou Lo c'iss c'igss lil 


ball ilseoll slleos jai ilulyll cgjà Co exà 


hs aigai 

uo Ah &cganoS &laïiuulll pièio câragl &äiph 
-gägill Lilpagio 

hill jlssiull 

Cilpéio êse ou éâllell aâg ilanill Juaïll 4e goi 
bill 8 Cilalaïlll auâgig (éui asgai cliill 


Uno ügjge jlasil 

äioo &hûi Jg= jlssilll Lau loles pié iglul 
so Qui Gbily olssiul 

gñgall Cililu 


Al Jhal dl jleal Léljéall gâgall ääleiall cibtull 


82 


83 


Log File 

À computer-generated file that contains a record 
of all activities within a server, operating system, 
or application. 

Log Loss 

A loss function that is used in binary logistic re- 
gression. 

Log Odds 

The logarithm of odds for an event. 


Logic Programming 
A type of programming approach where the exe- 
cution is based on formal logic statements. 


Logistic Regression 

A statistical model that uses a logistic function to 
convert a linear model's prediction into a value 
between 0 and 1. Also called "Logit Model." 


Logit 

A vector of non-normalized predictions that a clas- 
sification model makes. 

Logit Model 

See "Logistic Regression." 


Long Short-Term Memory (LSTM) 

A type of recurrent neural network that processes 
sequences of data with acceptable performance 
for both long and short dependencies. 

Loss 

A measure that shows how far a model's predic- 
tion is from the actual label by using a loss func- 
tion. Also called "Cost." 


Loss Curve 
A graph of the model error in training iterations. 


Loss Function 

A function used to define the error between the 
output of an algorithm and the target value. Also 
called "Cost Function." 


Loss Surface 
A graph that represents weights versus loss. 
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Machine Data 
The data that is automatically generated by a ma- 
chine. 
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Machine Hearing 
See "Machine Listening." 


Machine Intelligence 
See “Artificial Intelligence (Al).” 


Machine Learning (ML) 

An Al subfield concerned with learning patterns 
from available data to make predictions or decisions 
based on new data without explicit programming. 


Machine Learning Algorithm 

An algorithm that creates a machine learning mod- 
el by using training data. 

Machine Learning Model 

See "Model." 


Machine Listening 

The use of signal processing and machine learning 
for enabling a machine to understand sounds. Also 
called "Machine Hearing." 


Machine Perception 

The ability of a system to receive and explain data 
from the outside environment in a similar manner 
to how humans utilize their senses. 


Machine Touch 
Afield of machine perception concerned with pro- 
cessing tactile information by machine. 


Machine Translation 

À branch of computational linguistics that is con- 
cerned with translating text or speech from one 
language to another. 


Machine Vision 
See "Computer Vision." 


Maïjority Class 
The class with the most common label in a dataset 
that has imbalanced classes. 


Markov Chain 

A stochastic model that describes a sequence of 
possible events in which the probability of each 
event depends only on the previous event. Also 
called "Markov Process." 


Markov Decision Process (MDP) 

A discrete-time stochastic control process used 
for modeling decision-making in optimization 
problems under uncertainty. 
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Markov Process 
See "Markov Chain." 


Markov Property 

A property of specific environments in which state 
transitions are determined by the current state's 
information implicit and the agent's action. 


Masked Language Model 

A language model for predicting the probability 
of candidate words to fill in blanks based on the 
surrounding context. 


Master Data 

A set of identifiers and attributes that defines the 
main business entities and provides a context for 
business transactions. 


Master Data Management (MDM) 
The tool and processes that are used to govern 
and maintain master data. 


Mathematical Optimization 
See "Optimization." 


Matplotlib 
An open-source Python plotting library used to 
visualize different aspects of machine learning. 


Matrix Factorization 
A mechanism for determining which matrices have 
a dot product close to a target matrix. 


Maximum Likelihood Estimation (MLE) 
A method for determining the parameter values 
that maximize the probability of observed data. 


Mean Absolute Error (MAE) 

A metric used to evaluate a model's accuracy by 
calculating the average absolute difference be- 
tween the expected and predicted values for all 
training examples. 


Mean Squared Error (MSE) 

The average squared loss per example, calculated 
by dividing the squared loss by the number of 
examples. 


Membership Inference Attack 

An attack that aims to determine whether a data 
sample was used in the training of a machine 
learning model or not. 


Metadata 
The data that defines or describes other data. 
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Metaheuristic 
À higher-level algorithmic framework that can be 
used to solve different optimization problems. 


Metalearning 

A subrfield of machine learning used to discover 
or improve a learning algorithm by learning from a 
small amount of data or experiences from previous 
tasks. Also called "Learning to learn." 

Metric 

A measure to evaluate the performance of a model 
during training or testing. 

Mini Batch 

À small subset that has been selected randomly 
from a batch of examples to be used in a training 
iteration. 

Mini-Batch Stochastic Gradient Descent (SGD) 

A gradient descent algorithm that divides the train- 
ing dataset into small batches to estimate the 
gradient. 

Minimax Loss 

A loss function used in generative adversarial net- 
works (GANS) in which the generator attempts 
to minimize this function and the discriminator 
attempts to maximize it. 

Minority Class 

The class with the less common label in a dataset 
that has imbalanced classes. 


Misclassification Rate 
A measure to determine how often a model's pre- 
dictions are incorrect. 


Miss Rate 
See "False Negative Rate (FNR)." 


MNIST 

À public-domain dataset containing images of 
handwritten digits used to train and test various 
supervised machine learning algorithms. 
Mobile Robot 

A robot that is able to move around in its environ- 
ment and is not fixed to a physical location. 


Modality 
À general category of data such as text, images, 
audio, video, and numbers. 
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Model 

A representation of what a machine learning al- 
gorithm has learned from the training data. Also 
called "Machine Learning Model." 


Model Capacity 
The ability of a model to learn complex problems. 


Model Checking 

A verification technique that determines whether 
a model satisfies a given specification. 

Model Parallelism 

A technique used to speed up training or inference 
by dividing the model on multiple devices. 


Model Testing 
The process of evaluating the performance of a 
fully trained model by using a testing dataset. 


Model Training 

See "Training." 

Momentum 

A gradient descent algorithm in which a learning 
step is determined based on the derivative of the 
immediately preceded steps and the derivative of 
the current step. 


Monte Carlo Tree Search (MCTS) 

A decision-making algorithm that finds optimal deci- 
sions in a specific domain by combining Monte Carlo 
simulations and a tree search approach. 


Multi-Agent System (MAS) 
A system that consists of multiple agents interacting 
in a shared environment to achieve specific goals. 


Multi-Class Classification 

A type of classification that differentiates between 
more than two classes. 

Multi-Class Logistic Regression 

The use of logistic regression for classification 
problems with multiple classes. Also called "Mul- 
tinomial Regression." 

Multi-Head Self-Attention 

A self-attention mechanism that runs several times 
for every position in the input sequence. 


Multi-Swarm Optimization 

A variant of particle swarm optimization (PSO) that 
estimates the solution for complex problems using 
multiple swarms instead of a single swarm. 
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Multilayer Neural Network 
A neural network that has more than one layer of 
artificial neurons or nodes. 


Multimodal Model 
A model that has inputs or outputs with multiple 
modalities. 


Multinomial Classification 
See "Multi-Class Classification." 


Multinomial Regression 
See "Multi-Class Logistic Regression." 
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N-Gram 
A sequence of N words in a text. 


Naive Bayes Classifier 
A probabilistic classifier based on Bayes' Theorem 
for analyzing and categorizing data. 


Naive Semantics 

An approach to represent basic knowledge about 
a particular domain. 

Named-Entity Recognition (NER) 

The process of identifying the names of entities 
and their categories in text or speech. 

NaN Trap 

A situation when a number in a model becomes 
"Not a Number" during the training process, caus- 
ing other numbers to become "Not a Number." 
Narrow Artificial Intelligence (NA) 

A type of AI system that can only perform specific 
tasks. Also called "Artificial Narrow Intelligence 
(ANI)" or "Weak Artificial Intelligence." 

Natural Language 

À human language that can be expressed in text, 
speech, or sign. 

Natural Language Generation (NLG) 

A process of converting structured data into nat- 
ural language. 

Natural Language Processing (NLP) 

À branch of artificial intelligence concerned with 
understanding or generating human language in 
either text or speech form. 
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Natural Language Understanding (NLU) 

A subfield of natural language processing (NLP) 
concerned with building machines capable of 
comprehending the natural human language. 


Negative Class 
A class in a binary classification that represents 
the other possibility of a test. 


Negative Predictive Value (NPV) 
The proportion of predicted negative results that are 
true negative results. 


Neural Machine Translation (NMT) 

An approach in machine translation that uses a 
large artificial neural network to predict the prob- 
ability of a sequence of words, often in complete 
sentences. 


Neural Network (NN) 
See “Artificial Neural Network (ANN).” 


Neural Turing Machine (NTM) 


À neural network architecture that can deduce 


algorithms from examples. 

Neuromorphic Chip 

Atype of computer chip that acts as a neuralnetwork 
toimprovethe performanceofAlapplications. 


Neuromorphic Computing 

À computing approach that emulates the human 
brain by using special algorithms and hardware to 
improve power efficiency and computation speed. 
Also called "Neuromorphic Engineering." 
Neuromorphic Engineering 

See "Neuromorphic Computing." 


Neuron 

A neural network node that processes multiple-in- 
put values and produces a single output value. 
Also called "Artificial Neuron." 


Node 
See "Neuron." 


Noise (or Noisy Data) 
A meaningless data that affects the data analysis. 


Non-Parametric Model 

A machine learning model that does not make 
specific assumptions about the form of the map- 
ping function and does not assume a finite set of 
parameters. 
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Non-Relational Database 
A database that does not use a relational model. 


Non-Relational Model 
A logical data model that stores and manipulates 
data without following a relational model. 


Non-Response Bias 
A type of selection bias in which there are par- 
ticipation gaps in the collected data. Also called 
"Participation Bias." 


Nonstationarity 
A property of data that has a changing distribution 
over one or more dimensions. 


Normal Distribution 

À continuous probability distribution defined by 
two parameters: the mean and the standard devi- 
ation. Also called "Gaussian Distribution." 


NoSQL 

A database design approach that allows for stor- 
ing and querying data differently than relational 
databases. Also called "Not Only SQL", "Non- 
SQL" or "Non-Relational." 


Not Suitable For Work (NSFW) 
A shorthand used to describe a specific content as 
unsuitable for viewing in workplaces or near children. 


Novelty Detection 
The process of identifying new data that differs 
from the rest of the training data. 


Null Error Rate 
A measure of how incorrect the classifier would be 
if it just predicted the majority class. 


Numerical Data 
The features that are represented as integers or 
real numbers. 


NumPy 
An open-source Python library that offers powerful 
array operations for scientific and mathematical tasks. 
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Object 
An entity (physical or conceptual) that may have 
one or more attributes. 


Object Detection 
The process of detecting objects of a particular 
class in an image or a video. 
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Object Recognition 
A computer vision technique that identifies objects 
in images or videos. 


Object Tracking 
The process of following one or more objects in 
a given scene. 


Objective 
A metric that the algorithm is attempting to optimize. 


Objective Function 
A function that a model is attempting to improve. 


Observation 
An item of data or information related to items of data. 


Occam's Razor 
A principle for problem-solving that prefers simpler 
models over complex models. 


Offline Inference 
A process of generating predictions on a batch of 
observations. Also called "Batch Inference." 


One-Hot Encoding 
Amethod of converting data into vectors in which all 
elements are set to 0 except one element is set to 1. 


One-Shot Classification 
See "One-Shot Learning." 


One-Shot Learning 

An approach to learn effective classifiers from a 
single training example. Also called "One-Shot 
Classification." 


One-vs-All 

A classification method to deal with multi-class 
problems by using multiple binary classifiers, one 
binary classifier for each possible outcome. 


Online Inference 
A process of generating predictions in real-time 
on-demand. 


Online Machine Learning 

A subfield of machine learning in which a model learns 
incrementally from new data in a sequential way. Also 
called "Continual Learning (or Continuous Learning)." 


Ontology 

A knowledge representation that includes a set of 
classes, attributes, and relationships to model a 
knowledge domain. 
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Ontology Learning 

The data-driven techniques for building ontolo- 
gy using automatic or semi-automatic methods 
based on text mining or machine learning. 


Open-Source Software (055) 
The software whose source code is freely available 
to anyone to access, modify, use, and distribute. 


Operational Database (0DB) 
A database that is used to store and manipulate 
data in real-time. 


Optical Character Recognition (OCR) 
A program that identifies text in images and con- 
verts it into machine-readable text. 


Optimization 
The process of selecting the best element from all 
possible alternatives. Also called "Mathematical 
Optimization." 


Optimizer 

An algorithm or method used to reduce the loss 
function by adjusting the parameters of a neural 
network. 


Out-Group Homogeneity Bias 

The tendency to see members out of a group as 
more similar than members in the group when 
comparing their characteristics. 


Outlier 
An observed value in a dataset that appears to 
be inconsistent with or distant from most other 
values. 


Outlier Detection 

The process of identifying values in a dataset that 
appear inconsistent with or distant from most oth- 
er values. Also called "Unsupervised Anomaly 
Detection." 


Output Data (or Output) 
The data that is resulted from an information pro- 
cessing system or any of its parts. 


Output Layer 
The final layer in a neural network that contains 
the result. 


Overfitting 

A machine learning problem that occurs when 
creating a model that matches a training dataset 
so closely, making the model fails to generalize 
on new data. 
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Oversampling 

A technique used to balance a class-imbalanced 
dataset by duplicating examples from the minority 
class. Also called "Upsampling." 


P 


Cilisoll 65Uj 
célüalill &ijlgioll pé cililull &ijlgal osätaui igluul 
à yJäill Là üa po äliol JS él pe 


Padding 

The number of bits or characters added to a data 
structure, such as an image or a text, to enable 
further processing. 


Pandas 
An open-source data analysis and manipulation 
tool based on Python programming language. 


Parallel Processing 

An approach to handle heavy computation by 
simultaneously breaking up and running program 
tasks on multiple processors. 


Parallel Query 

A method that divides the query statement into 
multiple processes executed in parallel to increase 
the speed of the query. 


Parameter 
A variable of a model that can be trained iteratively 
by a machine learning system. 


Parameter Server (PS) 
À framework for distributed deep learning that 
keeps track of a model's parameters. 


Parameter Update 
The process of modifying a model's parameters 
during training. 


Parametric Model 

A machine learning model that makes specific as- 
sumptions about the form ofthe mapping function 
and assumes a finite set of parameters. 


Part-of-Speech Tagging 
The process of specifying a category to a word 
according to its grammatical properties. 


Partial Derivative 
The derivative of a several-variable function with 
respect to change in only one variable. 


Partially Observable Markov Decision Process (POMDP) 


A generalization of the Markov decision process 
(MDP) that allows agent planning under uncer- 
tainty and optimizes decision making in partially 
observable environments. 
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Partially Structured Data 
See “Semi-Structured Data.” 


Participation Bias 
See "Non-Response Bias." 


Particle Swarm Optimization (PSO) 
An optimization algorithm inspired by birds flocking 
to find an optimal solution in the solution space. 


Partitioning Strategy 
An algorithm that divides the variables across 
parameter servers. 


Pattern Recognition 
An area of machine learning that focuses on iden- 
tifying data patterns or its regularities. 


Perceptron 

An algorithm that accepts input values, executes a 
function on the weighted sum of the input values, 
and provides a single output value. 


Performance 
A quantitative or qualitative measure for the effi- 
ciency or the correctness of a model. 


Perplexity 
A measure to indicate the effectiveness of a model 
to accomplish its task. 


Personal Data 
The data about an identified or identifiable person. 


Personal Service Robot 
A robot that a common person uses to perform 
non-commercial tasks. 


Personalization 
The process of analyzing users’ trends and pat- 
terns to customize products for individual users. 


Pipeline 
Away of organizing the workflow of extracting insights 
from data or producing a machine learning model. 


Pipelining 

A type of model parallelism where the processing 
is split into a sequence of stages run over different 
devices. 


Planning 

A branch of AI concerned with selecting the appro- 
priate set of actions to achieve a specific goal. Also 
called "Automated Planning and Scheduling." 
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Policy 

The probabilistic mapping of an agent from states 
to actions in reinforcement learning. 

Pooling Layer 

The process of reducing a matrix created by a 
convolutional layer to a smaller one. 

Positive Class 

A class in a binary classification that represents 
what the test is seeking to find. 

Positive Predictive Value (PPV) 

The proportion of predicted positive results that 
are true positive results. 

Post-Processing 

The processing of the model's output after running 
a model. 

Posterior Probability 

À conditional probability distribution that rep- 
resents parameters after observing the data. 
Pre-Trained Model 

A model or model component that has been 
trained using another dataset. 

Precision 

A metric for determining the proportion of identi- 
fied positives that was actually correct. 
Precision-Recall Area Under Curve (PR AUC) 
An area under the curve of precision-recall points 
that represent different values of the classification 
thresholds. 


Precision-Recall Curve 

A graph that represents the precision versus the 
recall at different classification thresholds. 
Predictability 

A property of an Al system to enable stakeholders' 
assumptions about the system output. 

Prediction 

The output of a machine learning model. 
Prediction Bias 

A value that measures the difference between the 
average of predictions and the average of actual 
labels in a dataset. 
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Predictive Analytics 

The practice of using historical data combined 
with mathematical models to predict future out- 
comes or unknown events. 

Predictive Model 


A model that allows predicting future outcomes or 
unknown events based on past data. 


Predictive Parity 

A fairness metric that examines whether the pre- 
cision rates of a classifier are equivalent for allthe 
subgroups. Also called "Predictive Rate Parity." 


Predictive Rate Parity 
See "Predictive Parity." 


Predictive System 

A machine learning application capable of finding 
relationships between variables in historical data- 
sets to predict future outcomes. 


Preprocessing 
A process that transforms raw data into an under- 
standable format before it is used to train a model. 


Prevalence 
The proportion of how many times the “yes” con- 
dition appears in a sample. 


Principal Component Analysis (PCA) 

A technique used to remove redundant informa- 
tion and reduce dataset features to its principal 
components. 


Prior Belief 
The belief about data before starting the training on it. 


Prior Probability 

À probability distribution that represents the un- 
certainty of data before observing it. 

Privacy 

A right to control access to and use of physical 
items and personal information. 


Probabilistic Model 

A technique that incorporates random variables 
into a model to provide a probability distribution 
as a solution. 

Probabilistic Regression Model 

A regression model that uses the feature weights 
and their uncertainty to generate a prediction and 
its uncertainty. 
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Probability 
À number that represents the chance that a spe- 
cific event will happen. 


Professional Service Robot 
A service robot used for performing a commercial 
task and operated by a trained operator. 


Prolog 

A logic programming language used for rapid 
prototyping and symbol manipulation tasks like 
analyzing natural language. 


Proxy Attribute 


An attribute that is used as a substitute for a sen- 


sitive attribute. 

Proxy Label 

The data for approximating labels that are not 
directly available in a dataset. 


Pruning 

The process of cleaning up a code or a model to 
remove unwanted solutions and make the code 
easier to maintain. 

Python 

A high-level and general-purpose programming 
language which gained increasing popularity in 
data science and machine learning. 
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Q-Function 

A function in reinforcement learning that predicts 
the return when taking action in a state and fol- 
lowing a given policy. Also called "State-Action 
Value Function." 


Q-Learning 

A reinforcement learning algorithm that allows 
an agent to learn and act optimally in controlled 
domains. 


Qualification Problem 
The problem of specifying the precondlitions for 
action in a given situation. 


Quantile 
A cut point that divides a probability distribution 
into continuous intervals with equal probabilities. 


Quantile Bucketing 

The process of distributing values of a feature into 
buckets so that each bucket has approximately 
the same number of instances. 
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Quantization 
An algorithm that applies quantile bucketing on 
a specific feature in a dataset. 


Quantum Computing 

An emerging field of computer science that uti- 
lizes quantum mechanics phenomena for com- 
putational purposes. 


Query 
A statement asking for information to be retrieved. 


Query Analysis 
A process used in databases to analyze the query 
and optimize its performance. 


Query Language 
A language used for accessing the stored infor- 
mation in a database. 


Question Answering (QA) 
The process of determining the most appropriate 
answer to a question in natural language. 


GaSIl a sjaill 

ÿô dioo duals L,le Lo$ll guonill (Gilai diojjls 
lilull Aeoso 

PSI &nige 

LKuilSio pla ossi un Luulall ogle {6 & li JLso 
gs yAlHÏ aSil 


puleïul 

“ilogleo elyhul cul ëjhe 

pilsñuill Julsi 

pileiuil Jai Glilyll selgà 6 pssiui &uloc 
.dilsl cpusig 

pilsüul à 

ôselà ,6 dijhall c'ilogleall {Il Josagll äossiuns äsl 
lili 

alu ill sl 


R 

R Language ji äel 
A high-level programming language used broadly bail guulg (lai Le pssüui «sgiuall &lle àso p äel 
for statistical analysis and machine learning. ll eleïg (lai 


Radial Basis Function (RBF) 

A function that has values depends on distances 
from a center. 

Radial Basis Function Network (RBFN) 

An artificial neural network that uses radial basis 
function (RBF) as an activation function for non-lin- 
ear classification tasks. 

Random Forest 

An ensemble learning method that creates many 
decision trees and combines their outputs to get 
the best result. 

Random Policy 

A policy in reinforcement learning that selects an 
action at random. 

Rank 

A class's ordinal position in a machine learning model 
that categorizes classes from highest to lowest. 


Rater 
See "Annotator." 
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Re-Programmable Robot 

A robot designed so that the programmed move- 
ments or auxiliary functions can be modified with- 
out physical changes. 

Re-Ranking 

A final step in a recommendation system that con- 
siders additional criteria or constraints to regrade 
scored items. 


Real-Time Data 

The data that can be immediately accessed after 
it is generated. 

Reasoning and Decision Making 

A part of the knowledge representation and rea- 
soning approach that focuses on designing, an- 
alyzing, and implementing inference algorithms. 


Recall 

A metric for determining the proportion of actual 
positives that was identified correctly. Also called 
"True Positive Rate (TPR)." 

Receiver Operating Characteristic (ROC) Curve 
A graphical representation of a classifier's perfor- 
mance at all possible thresholds. 


Recommendation Algorithm 
See "Recommendation System." 


Recommendation Engine 
See "Recommendation System." 


Recommendation System 

A system that suggests a set of items to users based 
on their choices and behaviors. Also called "Recom- 
mendation Algorithm" or "Recommendation Engine”. 


Rectified Linear Unit (ReLU) 

An activation function that returns 0 if the input is 
negative or zero and returns the same input value 
if the input is positive. 

Recurrent Neural Network (RNN) 


A type of neural network used to understand se- 
quential information and identify patterns by run- 
ning multiple times. 


Reference Data 
The data that is used to categorize other data. 


Regression 
See "Regression Analysis." 
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Regression Analysis 

A statistical technique for determining the relation- 
ship between dependent and independent variables. 
Also called "Regression" or "Regression Model." 


Regression Model 
See "Regression Analysis." 


Regularization 
A process used to prevent overfitting by adding 
additional information. 


Regularization Rate 
A value that determines the relative importance of 
the regularization function. Also called "Lambda." 


Reinforcement Learning (RL) 

A type of machine learning that learns the optimal 
policy through rewards and punishments with the 
objective of optimizing the total reward. 


Relational Database 
À database in which the data is structured ac- 
cording to a relational model that consists of a 
collection of tables. 


Relational Machine Learning (RML) 
See "Statistical Relational Learning (SRL)." 


Relational Model 

A data model that has a structure based on a set 
of tables. 

Relationship Extraction 

The process of identifying relations among entities 
mentioned in a text. 

Reliability 

The property of intended consistency in behavior 
and results. 

Replay Buffer 

A memory that the agent uses to store state transi- 
tions and can be used to replay experience. 


Reporting Bias 

A type of bias that occurs when the frequency in 
a dataset doesn't reflect the frequency in the real 
world. 


Reservoir Computing (RC) 

À computational framework based on recurrent 
neural networks (RNNSs) most suited for dynamical 
systems and requires small training dataset and 
minimal computing resources. 
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Residual Sum of Squares (RSS) 

A statistical technique to measure the variance 
between the data and the values predicted by the 
estimation model. Also called "Sum of Squared 
Residuals (SSR)." 


Resilience 


À system's ability to recover operations quickly 


after an incident. 


Resource Description Framework (RDF) 
A standard model for exchanging data on the Web. 


Restricted Boltzmann Machine (RBM) 

A stochastic artificial neural network that learns a 
probability distribution over the inputs by applying 
graphical models. 


Rete Algorithm 
A pattern-matching algorithm that is used in rule 
engines to compare patterns to objects. 


Retraining 
The process of updating a trained model by train- 
ing it with new data. 


Return 
A total of rewards in reinforcement learning that 
an agent expects when following a specific policy. 


Reward 
A numerical result in reinforcement learning defined 
by the environment when taking action in a state. 


Reward Function 

An incentive mechanism in reinforcement learning 
that defines how an agent ought to take actions in 
an environment. 


Ridge Regularization 
See "L2 Regularization." 


Robot 

À programmable device capable of performing 
intended tasks automatically by sensing its en- 
vironment. 


Robot Control System 

A group of logic control and power functions that 
controls the robot's mechanical structure and 
communication with the environment. 


Robot Journalism 
The use of software to write content without human 
intervention. Also called "Automated Journalism." 


&üxioll iles poll Ege2 
il pnällg Sibilull cu call Gulël iles Liglul 
jasaill eigai Le: Lui 


sgasall 
sl egàg se &c puy Cilloall ésleñul le oil 6ysà 


sJlgall cà:ag jlhl 

«gl Le Sililull Jslil oué aigai 

ésñoll ulojilgs ail 

Cillloïsill 21joi pleï änlgue &iclilual &uuac àsui 
Saut ailoi Gulaï guyph ce illssoll le 


Si Uojylgs 

selgäll il$pso Là pssinui Dlail &äilho &uojjlgs 
<lüill Bloiill &ijläal 

usi 65lcl 

Cilily auysi gi ce Lijxoll aigaill Eusai &loc 
011355 

silell 

JSgll aoägis (sl (sjajeill pleill y CilalSalil w-Ilosl 


äiteo élu elil sic 


ëlälo 

pläll sic &tull osssi (siajeill a faïll (à dose au 
L äll 6 Je 

: ëlalsoll älls 

pi will &sausil sai Shjaill pleill Là 5 jaäsi al 
Lo äiy La Jleab plaäll Leclil Jisall 


ex hu 
"2 JUI hua" yhil 


t1919J 
Liläli &sglhall elgall elsl le jé à sol Ji jLes 
y Jlouiul à pe 


Cig19Hll eSsill pli 
ps Gill äâlall ail gg âhioll a$aill yo degano 
ähuall &yll ge aalgig Cigiopll asuilsoll all à 


&ig19 &èlma 


lai «spin JSsï ugs (sginall AUS] iso pl phil 
"&iaiéo äâla" Lil leule 


Robot System 
A system that includes robot(s) and any other de- 
vices that support it to perform its tasks. 


Robotic Device 

An actuated mechanism that fulfills the character- 
istics of an industrial or service robot but missing 
either the programmable axes or the autonomy 
degree. 


Robotic Process Automation (RPA) 
A type of process automation in which software 
or robot mimics how humans accomplish a task. 


Robotics 
A field that focuses on designing, developing, and 
implementing robots. 


Robustness 
The system's ability to preserve its performance 
under any circumstances. 


Root Mean Square Deviation (RMSD) 
See "Root Mean Squared Error (RMSE)." 


Root Mean Squared Error (RMSE) 

A measure of the difference between predicted 
values and actual values. Also called "Root Mean 
Square Deviation (RMSD)." 


Rotational Invariance 
The ability of an algorithm to properly classify im- 
ages regardless of the change in their orientation. 


Rule-Based Chatbot 
See "Flow-Based Chatbot." 


Rule-Based System 
An expert system that consists of "if-then" rules built 
by learning from real data or using expert knowledge. 
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A data element that a machine learning algorithm 
can process. 


Sample Size 
The number of individual observations that rep- 
resents a population in an experiment. 


Sampling Bias 
A type of selection bias in which the data is not 
randomly collected. 
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Scalability 
The ability of a system to deal with an increased 
amount of work. 


Scalar 
A single number that can be an example of a ten- 
sor with a rank of O. 


Scatter 
A method for distributing the processing across 
many nodes in a cluster. 


Scatter-Gather 

A form of processing for large datasets in which the 
required computation is divided up and distributed 
across different nodes of a cluster, and the overall 
result is then combined from each node. 


Scikit-Learn 
À popular open-source Python library for ma- 
chine learning. 


Scoring 

A recommendation system part that assigns a 
ranking to each item created by the candidate 
generation process. 

Search Query 

An actual word or string of words that a user enters 
into a search engine to find content. 


Security 

A degree to which a system protects data and 
information by allowing appropriate data access 
based on authorization levels. 


Selection Bias 
A bias in the selection of data that is used in train- 
ing machine learning models. 


Selective Filtering 
The process of ignoring noisy information in order 
to concentrate on important information. 


Self-Attention (or Self-Attention Layer) 

A neural network layer that outputs new sequence of 
embeddings by applying attention mechanism to the 
information from the elements of the input sequence. 


Self-Driving Car 
See “Autonomous Vehicle." 


Self-Labeling 

A type of semi-supervised learning methods that 
train a model on labeled data and then use this 
model to assign pseudo labels to unlabeled data. 
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Self-Supervised Learning 

A type of machine learning that uses unlabeled 
data to learn features by adopting self-defined 
pseudo-labels for the supervision. 


Self-Training 
A self-supervised learning method used in clas- 
sification when the unlabeled data is more than 
labeled data. 


Semantic Computing 

The processing to understand the meanings of 
information and express them in a machine-read- 
able format. 


Semantic Network 
See "Knowledge Graph." 


Semantic Reasoner 
A piece of software used to draw logical conse- 
quences from a set of confirmed facts and axioms. 


Semantic Web 
An extension of the World Wide Web (WWW) to en- 
able sharing and reusing data across applications 
by defining data stores, vocabularies, and rules 
to handle data. 


Semi-Structured Data 

The data that has some organization and charac- 
terized by its complexity and changing schema. 
Also called "Partially Structured Data." 


Semi-Supervised Anomaly Detection 
See "Novelty Detection." 


Semi-Supervised Learning 
A type of machine learning that leverages both 
labeled and unlabeled data for training. 


Sensitive Attribute 
À human attribute that has a special consideration 
for some reasons. 


Sensitive Data 

The data that may have harmful effects if revealed 
or misused. 

Sensitivity 

The rate of how a model correctly predicts positive 
labels when it is actually positive. 


Sentiment Analysis 

A process of using statistical or machine learning 
algorithms to determine feelings and opinions in 
a text or an image. 
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Sequence Model 

A model that has inputs with a sequential depen- 
dence. 

Sequence-To-Sequence (Seq2Sea) Task 

A task of converting a sequence of words to an- 
other sequence of words. 


Service Robot 

A robot that can perform useful tasks for humans 
or equipment except the industrial automation 
applications. 

Siamese Network 

A neural network used for image classification by 
training two neural networks to learn simultane- 
ously the similarity between images. 

Sigmoid Function 

A function that maps the output of logistic or 
multinomial regression to probabilities with a 
value between 0 and 1. 

Signal Processing 

An engineering field that focuses on analyzing and 
manipulating signals such as audios, images, and 
videos. 

Similarity Learning 

A type of supervised machine learning that trains 
a metric to find the similarity between two objects. 
Similarity Measure 

A metric used in clustering algorithms to evaluate 
how similar two examples are. 

Singularity 

See "Intelligence Explosion." 

Size Invariance 

The ability of an algorithm to properly classify im- 
ages regardless of the change in their size. 


Social Media Data 
The data that the users publicly share on social 
media. 


Soft Computing 

A set of computational techniques that deals with 
approximate models and provides effective solu- 
tions to complex problems. 

Softmax 

A function that gives a probability for each class 
in a multi-class classification model. 
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Software 
A group of instructions that direct a computer on 
what to do. 


Software Development Kit (SDK) 
A set of software development tools that enable the 
creation of applications on a particular platform. 


Solomonoff Probability 
See "Algorithmic Probability." 


SPARQL Protocol and RDF Query Language (SPARQL) 
A query language for accessing and manipulating 
resource description framework (RDF) data. 


Sparse Feature 
A feature vector that has mostly zero or empty values. 


Sparse Representation 
A representation that only contains non-zero el- 
ements. 


Sparse Vector 
A vector that has mostly zero values. 


Sparsity 
The number of elements set to zero in a vector or 
matrix divided by the total number of entries. 


Spatial Analysis 

The study of location-based data to discover pat- 
terns and insights. 

Spatial-Temporal Reasoning 

An area of artificial intelligence that is concerned 
with the representation and reasoning about space 
and time like a human. 

Specificity 

The rate of how a model correctly predicts nega- 
tive labels when it is actually negative. Also called 
"True Negative Rate (TPR)." 


Speech Recognition 

The process of converting speech into text using 
algorithms. Also called "Automatic Speech Rec- 
ognition (ASR)" or "Speech to Text (STT)." 


Speech Synthesis 


The process of generating artificial human speech. 
Also called "Text to Speech (TTS)." 


Speech to Text (STT) 
See "Speech Recognition." 
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Spiking Neural Network (SNN) 

A type of neural network that simulates the en- 
ergy-efficient signal system of the brain where 
neurons communicate asynchronously through 
binary values. 

Squared Hinge Loss 

The square of the Hinge Loss output that deals 
with outliers more strictly than the Hinge Loss. 


Squared Loss 

A loss function used in linear regression to cal- 
culate the squares of the difference between the 
predicted and actual values. Also called "L2 Loss." 


Staged Training 

A strategy for training a model stage-by-stage 
to improve the model quality or accelerate the 
training process. 

Standard Classification 

See “Classification.” 


State 

The parameter values in reinforcement learning 
that describe the environment's current configu- 
ration, which the agent uses to select an action. 
State-Action Value Function 

See "Q-Function." 


Static Model 
A model that is trained offline once and then used 
for a while. 


Stationarity 

A property of data that has a constant distribution 
over one or more dimensions. 

Statistical Classification 

A supervised learning approach that trains a model 
to categorize new unlabeled data based on rele- 
vant labeled data. 

Statistical Learning 

The use of statistical tools to analyze and under- 
stand data. 

Statistical Relational Learning (SRL) 

A branch of machine learning that combines statisti- 
cal learning to treat uncertainty in data and relational 
learning to handle complex relational structures. Also 
called "Relational Machine Learning (RML." 
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Step Evaluation 

The evaluation of a batch in a forward and back- 
ward way. 

Step Size 

See "Learning Rate." 


Stochastic Gradient Descent (SGD) 

A gradient descent algorithm that has a batch 
size of one. 

Stochastic Optimization (S0) 

A collection of methods to minimize or maximize 
an objective function with random variables. 


Stochastic Semantic Analysis 

An approach used as a semantic component in 
natural language understanding. 

Streaming Data 

The data that is arriving from a continuous oper- 
ating source. 

Strong Artificial Intelligence 

See “Artificial General Intelligence (AGI).” 


Structural Risk Minimization (SRM) 

An inductive principle used to address overfitting 
by balancing between training data fitness and 
model complexity. 

Structured Data 

The data that is organized in a predefined manner 
according to a set of rules. 


Structured Query Language (SQL) 
A standardized query language used in relational 
databases to define and manipulate data. 


Subsampling 
A technique to reduce the data size by choosing 
a subset of the original data. 


Subsymbolic Artificial Intelligence 

À branch of artificial intelligence concerned with 
learning through the use of implicit information and 
numeric representation. 

Sum of Squared Residuals (SSR) 

See "Residual Sum of Squares (RSS)." 


Superintelligence 
A hypothetical Al that can far surpass human abil- 
ity. Also called "Artificial Superintelligence (ASI)." 
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Supervised Learning 
Atype of machine learning that uses a labeled data- 
set to train a model for prediction or classification. 


Support Vector Machine (SVM) 

A supervised machine learning model used in data 
classification and regression analysis to find deci- 
sion boundaries with maximal margins. 

Support Vectors 

The data points that are closest to the hyperplane. 


Swarm Intelligence (si) 

A discipline that focuses on the collective behavior 
of elements that coordinate using decentralized 
and self-organized systems. 

Swarm Robotics 

The study of how to build robots that coordinate their 
behaviors from local interactions without relying on 
an external infrastructure or centralized control. 


Symbolic Artificial Intelligence 

À branch of artificial intelligence concerned with 
representing human knowledge in explicit facts 
and rules. 

Synthetic Feature 

A feature generated from one or more input fea- 
tures. 

System 

A group of elements that interact with each other 
to achieve specific goals. 
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Tabular Q-Learning 

À way in reinforcement learning for storing the 
Q-functions in a table for each combination of 
state and action. 


Target Network 

A neural network with a stable approximation of 
a main neural network that applies a Q-function 
or a policy. 

Targeted Advertising (or Targeted Ads) 

See "Ad Targeting." 


Taxonomy 

A formal structure of all classes or objects types 
within a specific domain, represented in a flat or 
hierarchical format. 
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Technological Singularity 
See "Intelligence Explosion." 


Temporal Data 
The data that is captured at different times. 


Temporal Difference (TD) Learning 

An unsupervised learning technique used in reinforce- 
ment learning in which an agent learns to predict the 
value of a variable at the end of a sequence of states. 


Tensor 

À multi-dimensional data structure (array) with a 
uniform type. 

Tensor Degree 

See "Tensor Rank." 


Tensor Network Theory (TNT) 

À numerical approach that provides an effective 
storing and processing for data with high dimen- 
sional complexity. 

Tensor Order 

See "Tensor Rank." 


Tensor Processing Unit (TPU) 

An application-specific integrated circuit (ASIC) 
to optimize the performance of machine learning 
processing. 

Tensor Rank 

The number of dimensions in a tensor. Also called 
"Tensor Order" or "Tensor Degree." 


Tensor Shape 
The number of elements contained in the dimen- 
sions of a tensor. 


Tensor Size 
The total number of elements in a tensor. 


Tensorflow 

An open-source platform for machine learning 
applications. 

Termination Condition 

A condition used in reinforcement learning to de- 
termine when an episode ends. 


Test Dataset (or Test Data, Test Set) 

The data used to evaluate the performance of 
a machine learning model after the training and 
validation phases. Also called "Evaluation Data." 
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Testing Error 

The error that occurs when running a model on 
new data. 

Testing Example 

An example in a testing dataset that consists of 
one or more features and sometimes a label. 


Text Mining 

The process of applying data mining techniques 
to extract patterns and insights from textual doc- 
uments. 

Text to Speech (TTS) 

See "Speech Synthesis." 


Text Tokenization 

The process of dividing up text into smaller units 
called tokens. 

Thompson Sampling 

A heuristic learning algorithm that performs actions 
sequentially to balance between maximizing immedi- 
ate performance and accumulating new information 
that could improve performance in the future. 

Time Complexity 

The amount of time to run an algorithm. 


Time Series Analysis 

A method of analyzing data gathered over a time 
interval. 

Time Stamp (or Timestamp) 

A specific time and date registered to a digital 
file to record when data was created, deleted, 
changed, or transmitted. 

Token 

A meaningful language unit (word, subword, or 
character) that a model uses for training and mak- 
ing predictions. 

Topic Modeling 

The use of statistical tools or machine learning to 
discover semantic topics in a collection of docu- 
ments. 

Torch 

An open-source scientific computing framework 
with broad support for machine learning algorithms, 
written in C and Lua programming languages. 
Trained Model 

The result of training a model. 
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Training 

The process of determining or optimizing the pa- 
rameters comprising a model based on a machine 
learning algorithm using training data. Also called 
"Model Training." 

Training Dataset (or Training Data, Training Set) 

The data that is given to a machine learning model 
during the training phase. 

Training Error 

The error that occurs when running a model on 
the training data. 

Training Example 

An example in a training dataset that consists of 
one or more features and sometimes a label. 


Trajectory 

A sequence of tuples in reinforcement learning 
that represent a sequence of the agent's state 
transitions. 

Transaction Processing 

Atype of computing that divides work into individ- 
ual and indivisible operations to support interac- 
tive applications. 

Transactional Data 

A form of data with dynamic nature and changes 
over time. 


Transfer Learning 

A machine learning technique in which the algo- 
rithm learns to perform a specific task and then 
uses that knowledge to perform other tasks. 
Transformer 

A neural network architecture for processing se- 
quential data based on a self-attention mecha- 
nism. Also called "Transformer Neural Network." 


Transformer Neural Network 
See "Transformer." 


Transhumanism 

A social and philosophical movement to under- 
stand and evaluate the opportunities of enhancing 
human condition and ability through advanced 
technologies. 

Translational Invariance 

The ability of an algorithm to correctly classify 
images even when the objects’ position inside the 
image changes. 
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Transliteration 

The process of representing words and phrases 
in one language using the alphabets of another 
language while maintaining their pronunciation. 


Transparency 

A property of a system to communicate the appro- 
priate information about the system to stakeholders, 
including features, components, and procedures. 


Transposed Convolution 

A technique that allows upsampling a feature vec- 
tor from a minimized dimensionality into a larger 
one. Also called "Fractionally Strided Convolution" 
or "Deconvolution." 


Tree Traversal 
A process of visiting all nodes in a tree. 


Trigram 
A sequence of three words in a text. 


True Negative (TN) 
Aresult that the model correctly predicted as negative. 


True Negative Rate (TPR) 
See "Specificity." 


True Positive (TP) 
A result that the model correctly predicted as positive. 


True Positive Rate (TPR) 
See "Recall." 


Trust 
The level of confidence a user or stakeholder has 
in a system or product to perform as intended. 


Trustworthiness 
The ability to meet expectations of the stakehold- 
ers in a verifiable manner. 


Tuple 

A record (or row) in a relational database. Also 
used as a synonym of "Feature Vector" in machine 
learning. 

Turing Test 

A test developed by Alan Turing to evaluate a ma- 
chine’s ability to show intelligent behavior equiv- 
alent to human intelligence. 


U 

Unawareness 

A case in which sensitive attributes are available 
but not used in the training data. 


plssüul Lo &el 6 iljuellg LiLolsil Jai dilss 
«bail eähi Ge Blaall ee spl äsll &ilagll caoyall 


&alauul 
Si! pläill Jgs äuulioll ilogleall Jlaul pli dual 
illysillg CiigSallg Ciljrall Jaiñig ‘äslaall lai 


yug$ee auiyi 
pans sf co yailasll anis culise 65Uj qui iglaul 
ol "js gugio api" Lil ae &llnio .pisi soi Il 
eu pill sai” 


ôpall quo 

8 pui (à sâoll 2405 (le jgpall &loc 

ali Cilols 

ea il côsiai 

lu lei le Byuo Bägi aigaill lgeäoï dati 
essall çiull Jssaoll 

."&icgi" Hail 

ea ill cüsiai 

aulaul Lil Kle sua loñgï sigaill lgoägï dau 
essall Hull Jssall 

'elexiul" phil 

&aill 

ol ÿô äshaall Lisla al pssimell sl ail Lsgiuto 
«gJholl Jul Joeuu aïioll al ptaill 

| &uägigo 

US 4 la àslnall Lilmal Lileñgï äui Le 6j all 
âcgeno 

Läil psSiuig . aille Cilily éselà (6 (La ai) Ju 
All ale à' 'yailasll ai" | uäslpas 


eiuygi Jus 
clgbu Jlgl) Le all 6134 auûil Qigï uill og Ji 
{spin elSAl al 4,55 


«lys! psc 
Sô pañluul el So éuluall Cilouull yrà pâgit älls 
sil Cilily 


Underfitting 

A machine learning problem that occurs when the 
model fails to adequately model the complexity 
of the training data, resulting in poor predictive 
ability. 

Undersampling 

A technique used to balance a class-imbalanced 
dataset by removing examples from the majority 
class. Also called "Downsampling." 

Unfriendly Artificial Intelligence (UFAI) 

À hypothetical Artificial General Intelligence (AG) 
that would be harmful to humanity rather than 
beneficial. 

Unidirectional 

A system in text analysis that assesses the text 
that comes before a specific part of a text. 


Unidirectional Language Model 

A model that defines the probability of a word 
presence in a text based on the text that comes 
before it. Also called "Causal Language Model." 


Universal Approximation Theorem 

A fundamental theorem that states that neural net- 
works can approximate any continuous function 
under an appropriate setting. 


Unlabeled Example 
An example that includes features but no label. 


Unstructured Data 
The data that does not have any structure or is not 
organized in a pre-defined manner. 


Unstructured Data Mining 

The process of discovering patterns from a large 
amount of unstructured data and extracting useful 
information. 


Unsupervised Anomaly Detection 
See “Outlier Detection." 


Unsupervised Learning 
Atype of machine learning that discovers patterns 
in an unlabeled dataset. 


Upsampling 
See "Oversampling." 


Upweighting 
The process of adding weight to the downsampled 
class proportional to the factor of the downsampling. 
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User 

An individual or group that interacts with a system 
or benefits from it. 

User Matrix 

A matrix in recommendation systems that contains 
latent factors about user preferences. 


Utterance 

Any input from a human user to a conversational 
Al system like a chatbot and speech recognition 
systems. 
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Validation 

A process used during training to assess the qual- 
ity and performance of a machine learning model 
using a validation set. 

Validation Dataset (or Validation Data, Validation Set) 
The data used to evaluate the performance of a 
machine learning model after the training phase. 


Vanishing Gradient Problem 


A case when the gradients of early hidden layers 


of a deep neural network become flat. 


Vertical Scaling 
A method of increasing the data processing perfor- 
mance by improving processors, memory, or storage. 


Vision Processing Unit (VPU) 

A microprocessor designed to accelerate com- 
puter vision tasks. 

Visual Question Answering (VQA) 

Atask of answering particular questions regarding 
an input image. 

Visual Recognition 

See "Image Recognition." 


Visual Search 
The use of an image or video to query similar con- 
tent visually. 
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Wasserstein Loss 

A loss function used in generative adversarial net- 
works based on the Earth-Mover's Distance (EMD) 
between two distributions. 
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Weak Artificial Intelligence 
See "Narrow Artificial Intelligence (NAI).” 


Web Analytics 
A method for gathering, analyzing, and reporting 
websites usage data. 


Web Crawler 

A bot operated by search engines to browse and 
index the content of the websites available on the 
Internet. Also called "Web Spider." 


Web Scraper 
À computer program that extracts information 
from websites. 


Web Scraping 
The process of extracting information from websites. 


Web Spider 
See "Web Crawler." 


Weight 

A value used as a coefficient for a linear model 
feature or a neural network connection between 
neurons. 

Weight Initialization 

The process of setting initial weights for a neural 
network. 

Weighted Alternating Least Squares (WALS) 

An algorithm used in recommendation systems 
for minimizing the objective function during matrix 
factorization. 

Wide Model 

A type of neural network with many input features 
that connect directly to the output node. 


Workload 
À combination of tasks that run on a given com- 
puter system. 
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An approach that enables the model to recognize 
classes never seen during training. 
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